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^ (54) Title: ANALYSIS METHOD 

\o 

^ (57) Abstract: This invention relates to novel methods for the identification of genes and gene products that are implicated in cer- 
^ tain disease states. According to the invention, there is provided a method for the identification of a gene that is implicated in a 

specific disease or physiological condition, said method comprising the steps of comparing: i) the transcriplome or proleome of a 
^ first specialised cell type that is implicated in the disease or condition under first and second experimental conditions; with ii) the 

transcriplome or proteome of a second specialised cell type under said first and said second experimental conditions; and identifying 
Q as a gene implicated in the disease or physiological condition, a gene that is differentially regulated in the two specialised cell types 
^ under the first and second experimental conditions. The invention also relates to novel genes and gene products identified using these 
^ methods. 



WO 02/46465 PCT/G BO 1/05458 

1 

Analysis method 

This invention relates to novel methods for the identification of genes and gene products that are 
implicated in certain disease states. The invention also relates to novel genes and gene products identified 
using these methods. 

5 All publications, patents and patent applications cited herein are incorporated in full by reference. 

One of the central goals in the field of gene expression is to understand and elucidate the relationship 
between a particular disease state and the gene expression pattern that defines and/or causes this disease 
state. Research has concentrated on differences in expression patterns between diseased and healthy 
tissues to elucidate the physiological mechanisms of disease. Identified differences in expression patterns 
10 provide putative points for therapeutic intervention to reverse the disease phenotype. These differences 
• also provide markers that are useful for diagnosis, and identify proteins for further investigation as agents 
implicated in the disease in question. 

Conventional methods for the elucidation of mechanisms of disease tend to concentrate on the correlation 
of a disease state with altered levels of a particular protein. Such methods include techniques of 
15 immunohislochemistry, the study of differential mRNA expression and the sequence analysis of particular 
proteins to identify mutations that are associated with a certain disease state. 

Recently, research has concentrated on analysis of the transcriptomes of organisms and cell types that are 
considered to be of scientific interest. By "transcriptome" is meant the exact set of transcripts that are 
expressed in a cell. The emerging field of nucleic acid arrays is one field in which a large number of 

20 powerful tools are being generated for the study of transcriptome variation between different tissue types. 
These tools are based on techniques originally pioneered by Schena et a/., 1995 (Science 270: 467-470) 
and Fodor et al, 1991 (Science 251 , 767-773) and facilitate the evaluation of variations in DNA orRNA 
sequences and of variations in expression levels from tissue samples and allow the identification and 
genotyping of mutations and polymorphisms in these sequences. The power of one such technique has 

25 recently been demonstrated by Perou et al, (Nature, 2000, 406:747-752), who generated molecular 
portraits of the transcriptomes of human breast tumours. 

Over recent years, the so-called "genomics revolution" has allowed access to large portions of whole 
genomes, including the human genome. The amount of sequence information now available considerably 
facilitates the analysis of the results of experiments that aim to elucidate the differences between gene 
30 expression in diseased and healthy tissues. As this information increases in scope and becomes more 
readily available, the study of the molecular mechanism of disease, and the elucidation of techniques for 
combatting these diseases will be considerably facilitated. 

However, there are notable disadvantages associated with all methods that are currently employed for the 
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analysis of human disease. Many methods currently employed utilise established cell lines. Because these 
cells have been manipulated to allow their immortalisation in cell culture, the physiological situation in 
these cells is not considered by the present inventors to be generally representative of the authentic 
situation in equivalent cells in vivo. Furthermore, most of these methods tend to utilise a global strategy 
5 for intervention, often ignoring the intricacies in gene expression that exist between different tissues. 
There thus remains a great need for the establishment of novel methods for the analysis of gene 
expression. 

According to the invention, there is provided a method for the identification of a gene that is implicated in 
a specific disease or physiological condition, said method comprising the steps of: 

10 a) comparing: 

i) the transcriptome or proteome of a first specialised cell type that is implicated in the 
disease or condition under first and second experimental conditions; with 

ii) the transcriptome or proteome of a second specialised cell type under said first and 
said second experimental conditions; and 

15 b) identifying as a gene implicated in the disease or physiological condition, a gene that is 

differentially regulated in the two specialised cell types under the first and second 
experimental conditions. 

Using this method, genes have been identified that respond to perturbations of cell physiology in a cell- 
specific rather than a generic fashion. The method of the invention exhibits significant advantages over 
20 conventional methods of identifying genes that are implicated in disease. 

Various groups have previously investigated mechanisms of physiological regulation, by comparing gene 
expression levels in the presence and absence of a physiological stimulus or challenge. Genes identified 
in a particular cell type as being expressed at different levels under different conditions are implicated as 
components of a pathway that is responsive to the altered conditions, or that is regulated differently under 
25 the altered conditions. However, these methods exhibit a tendency to ignore patterns of gene expression 
that are physiologically relevant. This inclination is considered to result from a prejudice in the art that 
dictates that cells respond to changes in certain physiological conditions in a generic fashion, rather than 
in a cell specific fashion. 

By "implicated in a specific disease or physiological condition" is meant that the gene has been found to 
30 possess a distinct role in a pathway that is involved in susceptibility to, generation of or maintenance of a 
particular disease phenotype or physiological condition. As will be apparent to the skilled reader, any 
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point in any pathway may be the unique point at which a cell departs from the normal physiological 
response and generates a disease phenotype. Often the effect that is manifested as a disease is the result of 
a mutation event, in which a mutation occurs in the sequence of a gene encoding a protein that functions 
in a relevant physiological pathway. 

5 There are numerous examples of diseases and conditions that may be studied using the method of the 
invention. Such pathological conditions include those that result from a change in the intrinsic nature of a 
cell (usually genetic) or from a change in the cellular microenvironment, either of which might be 
recapitulated in a laboratory setting. The methods may be applied to any disease or condition that is 
manifested in, or is generated in a specific cell type. 

10 Examples of such conditions include changes in the cellular microenvironment, exposure to hormones, 
growth factors, cytokines, chemokines, inflammatory agents, toxins, metabolites, pH } pharmaceutical 
agents, hypoxia, anoxia, ischemia, imbalance of any plasma-borne nutrient [including glucose, amino 
acids, co-factors, mineral salts, proteins and lipids], osmotic stress, temperature [hypo and hyper- 
thermia], mechanical stress, irradiation [ionising or non-ionising], cell-extracellular matrix interactions, 

15 cell-cell interactions, accumulations of foreign or pathological extracellular components, intracellular and 
extracellular pathogens [including bacteria, viruses, fungi and mycoplasma] and genetic perturbations 
[both epigenetic or mediated by mutation or polymorphism]. 

Examples of such diseases include cardiovascular disease, atherosclerosis, inflammatory conditions 
(including rheumatoid arthritis), cancer, ischemic disease, asthma, hematopoietic disorders, neurological 
20 diseases including Parkinson's and Alzheimer's diseases, infectious disease and allergies. 

One particular physiological response that has been used herein to illustrate the invention is the cellular 
response to hypoxia. The term "hypoxia" is intended to refer to an environment of reduced oxygen 
tension, as compared to the normal physiological environment for a particular organism, which is termed 
"normoxia". The prejudice in this technical field presents the view that there is a general, ubiquitous 
25 response to hypoxia, mediated primarily at the level of mRNA (transcriptional initiation and post- 
transcriptional stabilisation). 

In a variety of human diseases, cells are exposed to conditions of low oxygen tension, usually as a result 
of poor oxygen supply to the diseased area. For instance, tissue oxygenation plays a significant regulatory 
role in both apoptosis and in angiogenesis (Bouck e\ al, 1996, Adv. Cancer Res. 69:135-174; Bunn et al, 
30 1996, Physiol. Rev. 76:839-885; Dor et al, 1997, Trends Cardiovasc. Med., 7:289-294; Carmeliet et al, 
1998, Nature 394:485-490). Apoptosis (see Duke et al 1996, Sci. American, 80-87 for review) and 
growth arrest occur when cell growth and viability are reduced due to oxygen deprivation. Angiogenesis 
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(i.e. blood vessel growth, vascularization), is stimulated when hypooxygenated cells secrete factors that 
stimulate proliferation and migration of endothelial cells in an attempt to restore oxygen homeostasis (for 
review see Hanahan et al t 1996, Cell, 86:353-364). 

Ischaemic disease pathologies involve a decrease in the blood supply to a bodily organ, tissue or body 
5 part generally caused by constriction or obstruction of the blood vessels. For example, solid tumours 
typically have a disorganised blood supply, leading to hypoxic regions. Other disease conditions 
involving hypoxia include stroke, atherosclerosis, retinopathy, acute renal failure, myocardial infarction, 
stroke and hair Joss. Therefore, apoptosis and angiogenesis as induced by the ischaemic condition are also 
considered to be involved in these disease states. It is generally considered that understanding the 
10 mechanism by which cells respond to these diseases may be the key to the disease pathology and thus 
relevant to disease treatment. 

In a different but related approach, it is now recognised that angiogenesis is necessary for tumour growth 
and that retardation of this process provide a useful tool in controlling malignancy and retinopathies. For 
example, neoangiogenesis is seen in many forms of retinopathy and in tumour growth. The ability to be 
15 able to induce tumourigenic cells to undergo apoptosis is an extremely desirable goal; particularly in the 
cancer field, it has been observed that apoptosis and angiogenesis-related genes provide potent 
therapeutic .targets. It has also been observed that hypoxia plays a critical role in the selection of 
mutations that contribute to more severe tumourigenic phenotypes (Graeber et al, 1996 Nature, 
379(6560):88-91). 

20 Early in the history of this field it was discovered that a transcription factor, HlF-lalpha, is ubiquitously 
present in cells and is responsible for the induction of a number of genes in response to hypoxia. This 
protein is considered a master regulator of oxygen homeostasis (see, for example, Semenza, (1998) Curr. 
Op. Genetics and Dev. 8:588-594). Where HIF1 alpha is genetically knocked out, the hypoxia-inducible 
transcription of virtually all glycolytic enzymes has been shown to be inhibited. Glycolysis is an essential 

25 process which goes on in all mammalian cells. This finding is therefore consistent with previous work 
showing that when cells are exposed to conditions of hypoxia, they up-regulate glycolytic enzymes to 
enable ATP production, since oxidative phosphorylation is no longer feasible under conditions of low 
oxygen (Webster (1987) MoLCelLBiochem, 77: 19-28). Further support for a critical and general role of 
HIFIalpha in the hypoxic response is demonstrated by the knockout mouse, which dies at day 10.5 of 

30 gestation. The same is true of the knockout of the ARNT protein, the dimerisation partner of HIFIalpha. 

For the first time, it is demonstrated herein that different tissues and cell types exhibit a very different 
response to hypoxia, at the level of the induction and repression of gene expression. This has allowed the 
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detailed elucidation of the mechanism of this particular physiological response, so paving the way for the 
development of improved therapeutic agents that target components of the response pathway in particular 
tissues. Although conventional approaches to the analysis of this mechanism have successfully identified 
numerous genes, because of the universal prejudice in the art that these components will all be 
5 induced/repressed similarly in all cell types, all the approaches suggested have hitherto been limited to the 
design of therapeutic agents that act in a global fashion. 

The methods of the present invention therefore extend and add to previous work performed in this field, 
in that the discoveries made now allow the design of agents that target the hypoxic response in specific 
tissues. For example, it is known that brain and heart tissues die very rapidly after ischaemic insult. By 

10 using the method of the invention, it is quite possible that these tissues will be found to share common 
features in their response to hypoxia, that is different from other cell types. This might allow, for 
example, the design of a combination cardioprotective and neuroprotective agent effective against this 
subset of body tissues. Alternatively, the hypoxic response in these tissues might be found to be quite 
different. This information would, then be taken into account when designing therapeutic 

15 countermeasures, in that an agent would be designed for the unique neurological or cardiological tissue 
concerned. 

The method of the invention involves the comparison of the transcriptomes or proteomes of at least two 
specialised cell types under two different physiological or genetic conditions. By "transcriptome" is 
meant the exact set of transcripts that are expressed in a cell. The transcriptome thus has a qualitative 
20 element (the identity of individual gene transcripts) and a quantitative element (the proportion of each 
unique transcript in the total number of individual transcripts present in the cell at a particular moment). 
By "proteome" is meant the exact set of protein molecules that are expressed in a cell. 

By "specialised cell type" is meant a cell type that has a restricted biochemical capacity and that can be 
unambiguously identified as possessing a unique set of biochemical and physiological functions. 
25 Preferably, the specialised cells are primary cells, and not cell lines or whole body tissues. Primary cells 
are cells that cannot proliferate indefinitely in culture. Primary cells can be derived from adult tissue, or 
from embryo tissue that is differentiated in culture to an adult cell or to a precursor of an adult cell that 
displays specialised characteristics. 

Examples of preferred specialised cell types include cardiomyocytes, endothelial cells, sensory neurons, 
30 motor neurons, CNS neurons (all types), astrocytes, glial cells, Schwann cells, mast cells, eosinophils, 
smooth muscle cells, skeletal muscle cells, pericytes, lymphocytes, tumor cells, monocytes, macrophages, 
foamy macrophages, granulocytes, synovial cells / synovial fibroblasts, epithelial cells (varieties from all 
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tissues/ organs). Examples of other suitable specialised cell types include vascular endothelial cells, 
smooth muscle cells (aortic, bronchial, coronary artery, pulmonary artery, etc), skeletal muscle cells, 
cardiomyocyte cells, fibroblasts (many types, such as synovial), keratinocytes, hepatocytes, dendritic 
cells, astrocytes, neurone cells (including mesencephalic, hippocampal, striatal, thalamic, hypothalamic, 
5 olfactory bulb, substantia nigra, locus coeruleus, cortex, dorsal root ganglia, superior cervical ganglia, 
sensory, motor, cerebellar cells), neutrophils, eosinophils, basophils, mast cells, monocytes, macrophage 
cells, erythrocytes, megakaryocytes, hematopoietic progenitor cells, hematopoietic pluripotent stem cells, 
any stem cells, any progenitor cells, epithelial cells, melanocytes, osteoblasts, osteoclasts, stromal cells, 
purkinje cells, T-cells, B -cells, synovial cells, pancreatic islet cells (alpha and beta), leukemia cells, 
10 lymphoma cells, tumour cells, retinal cells, adrenal chromaffin cells. As will be apparent to the skilled 
reader, it is not here possible to provide an exhaustive list of specialised cell types that may be studied 
according to the methods of the present invention. 

Intended as being included within the method of the invention is the possibility of using, as two different 
specialised cell types, two different physiological states of the same cell type, for example, activated and 
15 resting macrophages. 

The transcriptomes of the specialised cell types are compared under different experimental conditions. 
The term "experimental conditions" is used broadly in this context and is intended to embrace any 
physiological or genetic conditions to which a cell type may be exposed. The intention of the method is to 
compare the transcriptomes or proteomes of the cell types under different experimental conditions that 
20 have a physiological relevance. Accordingly, the state of the transcriptome or proteome under one set of 
experimental conditions will generally act as a control against which the transcriptome or proteome may 
be compared under a second set of experimental conditions. Any distinct physiologically-relevant 
conditions may therefore be of interest. 

Examples of suitable physiological experimental conditions include conditions under which the cell is 
25 submitted to a physiological, mechanical, temperature, chemical, toxic or pharmaceutical stress. One 
example is hypoxia, defined herein as a physiological state in which oxygen demand by the cell exceeds 
its supply to the cell. The transcriptome or proteome under this set of experimental conditions may be 
compared to the transcriptome or proteome under conditions of normoxia, when oxygen supply is in 
concordance with the demand by the cell. 

30 The transcriptomes or proteomes may also be compared under different genetic conditions. By "genetic 
conditions" is meant that the genotype of the compared cell populations contains a different genetic 
component. This may be the presence of one or more different, non-endogenous nucleic acid molecules in • 
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the cell, herein referred to collectively as "genetic elements". Such genetic element(s) may potentially be 
incorporated into the genome of the cell, or alternatively may exist as a separate genetic entity, for 
example, as an extra-chromosomal element such as a plasmid or episome. Alternatively, the genome may 
have been perturbed by external intervention, for example, to increase or decrease the expression of a 
5 particular gene or genes. A number of variations on this theme are possible, including the overexpression 
of a genetic element via the administration of the functional gene, the overexpression of a genetic element 
via the administration of a regulator of the functional gene (such as, for example, a transcription factor 
[either natural or artificially constructed via the fusion of a DNA binding domain with an activator 
domain]), the inhibition of the expression of a functional gene (for example, using antisense RNA or 
10 ribozymes), the inhibition of the expression of a functional gene. (for example, using a transdominant 
protein) and the inhibition of the expression of a functional gene (for example, using a repressor protein 
that is either natural or artificially constructed from a DNA binding protein fused to a repressor domain). 

A particular example of a genetic perturbation as envisaged herein, that forms one preferred embodiment 
of the method of the present invention, is the so-called "Smartomics" technology that forms the basis for 

15 co-pending, co-owned International patent application PCT/GB01/00758. According to this technology, a 
heterologous nucleic acid is introduced into a primary cell to augment a specific natural physiological 
response. "Smartomics" may be applied to the current invention by measuring and comparing cellular 
responses to a heterologous gene in two or more distinct cell types, both with and without the natural 
physiological stimulus. Lentivirus technology is used to introduce the heterologous nucleic acid molecule 

20 in such a way that there is negligible perturbation of endogenous gene expression. For this reason, this 
technology exhibits significant benefits over conventional technology of a similar nature, since the prior 
art methods are generally invasive, having downstream effects other than the simple introduction of the 
heterologous nucleic acid molecule. The Smartomics technology allows much more precise 
measurements to be taken of the effect of introducing the heterologous nucleic acid. 

25 The method of the invention allows the identification of genes that are implicated in a specific disease or 
physiological condition. The genes identified in this way are candidate targets for antagonists or agonists 
that modulate disease states pertinent to that specialised cell type. This allows the development of 
selective agonists and antagonists, rather than broad spectrum agonists and antagonists. This approach 
thus adds value in the selective treatment of disease. Furthermore the identified genes are associated with 

30 regulatory elements that provide alternative and additional candidate targets for exploitation for the 
delivery of gene products to that cell in a cell-specific fashion. The genes and regulatory elements 
identified according to the method of the invention can be used directly in therapeutic applications via 
gene therapy, via recombinant protein methods or via chemical mimetics or as targets for the 
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development of agonists and antagonists such as antibodies, small chemical molecules, peptides, 
regulatory nucleic acids. 

The step of comparison of the transcriptomes or proteomes of the first and second specialised cell types 
under first and second experimental conditions may be effected using any approach that allows the 
5 quantitative comparison of gene or protein expression, and a number of such means will be known to 
those of skill in the art. Such experiments have only become possible in recent years, due to certain 
advances in technology that have allowed the large scale, high throughput analysis of gene expression. 

One example of a method that allows the comparison of the transcriptome of a specific cell type with a 
second or subsequent transcriptome involves the generation of a set of clones that represent all the 
10 transcripts expressed in a cell under the conditions in which the cell is maintained. This may be done by 
constructing a cDNA library, in which copies of ail mRNA transcripts expressed in the cell are cloned 
into a suitable vector for subsequent analysis. 

Such libraries may be normalised cDNA libraries, in which the distribution of genes in the library has 
been biased to reduce the number of clones that represent genes with large numbers of transcripts (such 
15 as, for example, beta-actin) and thus reduce the repetitive nature of the library. Normalisation thus acts to 
reduce the frequency of genes expressed at high levels and to enhance the frequency of genes expressed at 
low levels (see de Fatima Bonaldo el al, Genome Research 6: 791-806 (1996)). 

Libraries may also be subtracted cDNA libraries, in which the distribution of genes is manipulated to 
remove genes that are expressed in both mRNA populations used to construct the library. The 
20 commercially-available PCR Select kit (Clontech, Inc) is an example of a system useful to generate such 
libraries. 

cDNA clones generated as reflective of the transcriptome of a specific ceil type may then be amplified, 
and processed to evaluate the identity of the nucleic acid clones. For example, multiple clones may be 
picked and used as template for PCR amplification. The PCR products may then be arrayed onto 
25 membranes or glass slides to create nucleic acid arrays. For expression profiling, these arrays are then 
hybridised to complex nucleic acid probes in order to quantitate the abundance of individual genes 
contained in the probes. 

A recent summary of nucleic acid array technology that is useful in the analysis of the transcriptome of a 
cell population is provided in Nature Genetics, (1999) (21 suppl; 1-61). There are various types of array 
30 technology currently used, including "microarrays", or "chips", which are high density cDNA arrays 
produced on glass slides, often produced using photolithography. A second type of array is the 
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"macroarray", which is an array with sub-millimetre spot-spot distances produced on a nylon membrane. 
One example of this type of array are the nylon-based microarrays sold commercially by Research 
Genetics Inc. (termed Research Genetics Human GeneFilters) that each contain 5,300 cDNA fragments of 
known identity. The whole series of arrays covers some 35,000 cDNA fragments. This particular array 
5 system (and others like it) allow the identification of transcripts that are down-regulated, as well as those 
that are up-regulated, since the range of genes used to manufacture the arrays are not biased. 

The step of comparison may be effected by utilising subtracted cDNA libraries. Using this approach, the 
transcriptome of one specialised cell type under first experimental conditions is subtracted against the 
transcriptome under second experimental conditions. This reveals the differences in expression under the 
10 two experimental conditions tested. When this is performed for both specialised cell types, the differential 
regulation of gene expression under the two experimental conditions is revealed. 

The step of comparison is through the detection of genes that are differentially regulated in the two 
specialised cell types examined under the first and second experimental conditions. As an example, a 
human cardiom yoblast (cell type A) and a human macrophage (cell type B) may be placed at the same 

15 temperature and at a high oxygen tension (first experimental conditions [1]). Cells from the same cell 
types are also incubated at this temperature, yet under conditions of low oxygen tension (second 
experimental conditions [2]). In this simple example, there are then a minimum of four combinations of 
cell type and condition, A[l], B[I], A [2] and B[2J. "Snapshots" are taken of the transcriptomes of both 
cell types under the "normoxic" and the "hypoxic" experimental conditions, by preparing messenger 

20 RNA from all four combinations. Differences in the regulation of genes can then be analysed, for 
example, using a process of subtractive hybridisation. 

The mechanism of transcriptome comparison in the above example may be as follows. Subtracted cDNA 
libraries are separately prepared for hypoxic macrophages and cardiomyoblasts; for both cell types, their 
cDNA under normoxic conditions is subtracted against their cDNA under hypoxic conditions. This might 

25 be effected by harvesting RNA from cells both in normoxia and hypoxia, and preparing cDNA. 
Subtractive hybridization, optionally including suppression PCR, may then be performed to remove genes 
from the hypoxic cell cDNA which are also present in cDNA from normoxic cells. Insert DNA from 
these subtracted libraries can then be amplified and arrayed onto duplicate membranes. Quantitative 
hybridization with pre-library cDNA material (normoxia and hypoxia) then allows the comparison of 

30 differentially-expressed clones in the two cell types. The clones representing hypoxia-inducible genes 
may be then be identified, for example, by sequencing. 



WO 02/46465 



PCT/GBIH/05458 



10 

Other techniques that are suitable for the analysis of the transcriptome of a specific cell type include serial 
analysis of gene expression (SAGE; Velculescu et al, Science (1995) 270; 484-487), Selective 
amplification via biotin- and restriction-mediated enrichment (SABRE) (Lavery et al, (1997), PNAS USA 
94: p683 1 -6836): Differential display (for example, indexing differential display reverse transcriptase 
5 polymerase chain reaction (DDRT-PCR; Mahadeva et al. (1998) J. Moi.Biol. 284, 1391-1398)); 
representational difference analysis (RDA) (Hubank (1999) Methods in Enzymology 303: 325-349); 
differentia] screening of cDNA libraries (see Sagerstrom et al. (1997) Annu. Rev. Biochem. 66: 751-783); 
"Advanced Molecular Biology", R.M. Twyman (1998) Bios Scientific Publishers, Oxford; "Nucleic Acid 
Hybridization", M. L. M .Anderson (1999) Bios Scientific Publishers, Oxford); Northern blotting; RNAse 
10 protection assays; S 1 -nuclease protection assays; RT-PCR; real time RT-PCR (Taq-man); EST 
sequencing; massively parallel signature sequencing (MPSS); and sequencing by hybridisation (SBH) 
(see. Drmanac R. et al (1999), Methods in Enzymology 303:165-178). Many of these techniques are 
reviewed in "Comparative gene-expression analysis" Trends Biotechnol. 1999 Feb;l7(2):73-8. 

Methods such as these have been applied widely ^to study mechanisms of biological response. In 
15 particular, microarrays have been used widely to compare gene expression levels between normal and 
diseased tissue. More typically, however, comparisons are performed to detect changes in gene 
expression that are associated with specific aspects of disease progression or pathology. For instance, a 
study of prostate cancer would examine changes associated with the step-wise progression to full 
malignancy or the dependence on androgens for growth. 

20 Transcriptome analysis is complemented by the analysis of the complete protein make-up of a cell, 
referred to as proteomics. The use of two dimensional SDS-PAGE gels in combination with amino acid 
sequencing by mass spectrometry is currently the most widely-used technique in this field (see 
"Proteomics to study genes and genomes" Akhilesh Pandey and Matthias Mann, (2000), Nature 405: 837- 
846). Additionally, the recent developments in the field of protein and antibody arrays now allow the 

25 simultaneous detection of a large number of proteins. For example, low-density protein arrays on filter 
membranes, such as the universal protein array system (Ge H, (2000) Nucleic Acids Res. 28(2), e3) allow 
imaging of arrayed antigens using standard ELISA techniques and a scanning charge-coupled device 
(CCD) detector. Immuno-sensor arrays have also been developed that enable the simultaneous detection 
of clinical analytes. It is now possible using protein arrays, to profile protein expression in bodily fluids, 

30 such as in sera of healthy or diseased subjects, as well as in patients pre- and post-drug treatment. 

Antibody arrays also facilitate the extensive parallel analysis of numerous proteins that are hypothetically 
implicated in a disease or particular physiological state. A number of methods for the preparation of 
antibody arrays have recently been reported (see CahilL Trends in Biotechnology, 2000 7:47-51). 
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It is not the intention here to review studies that have been conducted in this area previously. However 
one example of a physiological condition that has already received considerable attention is the response 
to hypoxia. Several patent applications have now been published that involve an examination of the 
genetic response to hypoxia (see WOOO/12139, Quark Biotech, Inc.; WO00/12525, Quark Biotech, Inc.; 
5 WO99/09049, Quark Biotech, Inc.; WO99/09046, Quark Biotech, Inc.; W099/48916, The Board of 
Trustees of the- Leland Stanford Jr. University). These patent applications generally utilise methods of 
subtractive hybridisation and differential expression gene microarray analysis to examine this genetic 
response in certain cell lines. The studies have implicated specific genes as being either repressed or 
induced under hypoxic conditions as compared to their expression under normoxic conditions. These 
10 genes are taught as being useful generally in all cell types, being involved in the (generic) hypoxic 
response. 

Significantly, the present invention extends this work, and, indeed, defines a significant advance over 
similar work that has been performed on the genetic mechanisms that act in response to other 
physiological or genetic stimuli. The present inventors, using the novel methods disclosed herein, have 
15 discovered that far from being generic, the cellular response to many physiological conditions differs 
. markedly between different cell types. The cellular response that has been studied in order to illustrate 
this finding is the response to hypoxia. From these results, it has been inferred herein, quite reasonably, 
that far from being generic, cellular response mechanisms differ widely, depending on cell type. 

This discovery has far-reaching implications as regards the design of therapeutic agents that are effective 
20 to counter a disease or physiological condition. For example, an agent that is effective to prevent the 
drastic effects of hypoxia in a neurone (the effects of which include stroke) might be totally ineffective in 
countering the same effects in a cardiomyocyte (chronic ischemic heart disease). Through analysing the 
mechanism of the hypoxic response in different cell types, it may be, in contrast to the example given 
above, that a particular gene is involved in the hypoxic response in both cardiomyocytes and neurones. 
25 Were this to be the case, this would allow the design of a combined medicament, for example, a 
combined cardioprotective and neuroprotective agent. There thus remains a great need for the 
identification of proteins implicated in the physiological mechanism of hypoxia. 

According to a further aspect of the invention, there are provided genes and proteins that are identified 
using a method according to any one of the above-described aspects of the invention. Certain proteins, 
30 whose sequences are identified herein as SEQ ID Nos: 1,3, 5,7, 9, II, 13, 15, 17, 19, 21, 23,25, 27,29, 
31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 
103, 1 13, 1 15, 1 19, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187,201, 
205, 207 and 209, are functionally annotated for the first time. At present, all of these sequences are only 
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identified as "hypothetical proteins" in the public databases. Each and every one of these sequences forms 
an embodiment of this aspect of the invention. 

The invention also includes proteins whose amino acid sequences are encoded by a nucleic acid sequence 
recited in various cDNAs and ESTs deposited in the public databases, or encoded by a gene identified 
5 from such an EST. These cDNAs and. ESTs are presented herein as SEQ ID Nos: 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 
204, 206, 208, 210, 212, 214 and 216. At present, all of these cDNA and EST sequences are functionally 
10 unannotated in the public databases. Each and every one of these sequences forms an embodiment of this 
aspect of the invention. 

One embodiment of this aspect of the invention provides substantially purified polypeptide, which 
polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, 
15 15,17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 

69,73,75,77,85,87,89,91,93,95,99,103,1 13,1 15,1 19,121,129,131,133,1 37,139,141, 
145, 151, 153, 157, 159, 163, 169, 181, 187, 201,205, 207 or 209; 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 

20 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 

98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214 and 216 or encoded by a gene identified from an EST recited in any one of these SEQ 

25 ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 
determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

30 The polypeptide sequences recited in SEQ ID Nos: 1, 3, 5, 7, 9, 1 1,13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 7-7, 85, 87, 89, 91, 93, 95, 99, 103, 
1 13, 1 15, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187,201,205, 
207 and 209 were, prior to the present disclosure, unannotated in the literature and public sequence 
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databases. Accordingly, until now, no biological function has been attributed to these polypeptide 
sequences; each of these sequences is generally labelled in the databases as a "hypothetical protein". The 
methods of the present invention, described above, have now elucidated a biological function for these 
polypeptides, in that they have been found to be differentially regulated under physiological conditions of 
5 hypoxia. 

These discoveries allow the development of regulators, such as small drug molecules, that affect the 
activity of these polypeptides, so allowing diseases and physiological conditions that are caused by 
hypoxia, or in which hypoxia has been implicated, to be treated. These discoveries also allow the 
development of diagnostic agents that are suitable for the detection of hypoxia in biological tissues and, 
10 through the identification of mutations and polymorphisms (such as SNPs) within genes coding for the 
proteins implicated herein, allows the assessment of an individual's risk of being susceptible to diseases 
and physiological conditions in which hypoxia is implicated. 

The biological activity of polypeptides whose sequences are listed in SEQ ID Nos: 1, 3, 5, 7, 9, 11, 13, 
15, 17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 

15 77, 85, 87, 89, 91, 93, 95, 99, 103, 113, 1 15, 1 19, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 
159, 163, 169, 181, 187, 201, 205, 207 and 209 has been found to be hypoxia-regulated. The expression 
of some of these polypeptides has been found to be induced under conditions of hypoxia, whilst the 
expression of other polypeptides has been found to be repressed. By "hypoxia-induced" is meant that the 
polypeptide is expressed at a higher level when a cell is exposed to hypoxic conditions as compared to its 

20 expression level under normoxic conditions. By "hypoxia-repressed" is meant that the polypeptide is 
expressed at a lower level when a cell is exposed to hypoxic conditions as compared to its expression 
level under normoxic conditions. 

The following polypeptides have been found to be hypoxia-induced: those polypeptides whose amino 
acid sequence is recited in SEQ ID Nos: 1, 3, 5, 7, 9, 1 1,13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 63, 67, 

25 69,73,75,77, 85,87, 89,91, 93,95,99, 103 J 13, 1 15, 1 19, 121, 129, 131, 133, 137, 139 and 141; and 
those polypeptides whose amino acid sequence is encoded by a nucleic acid sequence recited in SEQ ID 
Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142 and 144 or is encoded by a gene identified from an EST recited in any 

30 one of these SEQ ID Nos.. 

The following polypeptides have been found to be hypoxia-repressed: those polypeptides whose amino 
acid sequence is recited in SEQ ID Nos.: 35, 37, 39, 41, 43, 45, 47, 49, 51 , 53, 55, 57, 59, 145, 151, 153, 
157, 159, 163, 169, 181, 187, 201,205,207 and 209; and those polypeptides whose amino acid sequence 
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is encoded by a nucleic acid sequence recited in SEQ ID Nos.: 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 
58, 60, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 
184, 186, 188, 190, 192, 194, 196, 198,200,202,204,206,208,210,212,214 and 216, or encoded by a 
gene identified from an EST recited in any one of these SEQ ID Nos. 

5 For the purposes of this document, the term "hypoxia" should be taken to mean an environment of 
oxygen tension such that the oxygen content is between about 5% and 0.1 % (v/v). In most cases, hypoxic 
tissue will have an oxygen content that is less than or equal to about 2%. The term "normoxia" should be 
taken to mean conditions comprising a normal level of oxygen for the environment concerned. Normoxic 
tissue typically has an oxygen content above about 5%. 

10 The polypeptide sequences whose amino acid sequence is encoded by a nucleic acid sequence recited in 
SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 
106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 
190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, or whose amino acid sequence 

15 is encoded by a gene identified from an EST recited in any one of these SEQ ID Nos., were also, prior to 
the present disclosure, unannotated in the literature and public sequence databases, meaning that until 
now, no biological function has been attributed to these polypeptide sequences. 

The sequences in this group fall into a number of different categories. The first of these are cDNA clones, 
for which a protein sequence has not been predicted by the depositor. A second category is expressed 

20 sequence tag (EST) sequences that are represented in the UniGene database 
(http://www.ncbi.nlm.nih.gov/UniGene/), which contain modest or weak homology to known proteins 
when translated. ESTs are single-pass sequence files of the 5' region of an organism's expressed genome 
as accessed via a force cloned cDNA library. EST sequences tend to be short and as a general rule are 
error-prone. UniGene (see http://www.ncbi.nlm.nih.gov/Web/NewsItr/aug96.html for review) is an 

25 experimental system for automatically partitioning these EST sequences into a non-redundant set of gene- 
oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as 
related information such as the tissue types in which the gene has been expressed and map location. A 
third category of hits identified by the methods described herein is EST sequences that are contained in 
Unigene clusters, but which are not annotated and exhibit no homologies to proteins contained in the 

30 public databases. The fourth and final category encompasses singleton EST sequence entries that are not 
incorporated as entries in the Unigene database and that only appear as single entries in the public 
databases. 
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The methods of the present invention, described above, have now elucidated a biological function for 
polypeptides that are encoded by genes incorporating cDNA and EST sequences that fall into the four 
categories set out above, in that these sequences have been found to be differentially regulated under 
physiological conditions of hypoxia. Such polypeptides may have an amino acid sequence that is encoded 
5 by a nucleic acid sequence recited in any one of SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 
212, 214 and 216. However, the EST sequences in particular may not be part of the actual coding 

10 sequence for a gene, often representing regulatory regions of the gene, or regions that are transcribed, but 
not translated into polypeptide. Accordingly, this aspect of the invention also includes polypeptides that 
..are encoded by a gene identified from an EST recited in any one of SEQ ID Nos: 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 

15 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 
204,206, 208, 210,212, 214 and 216; 

Polypeptides of this aspect of the invention are intended to include fragments of polypeptides according 
to i) or ii) as defined above, provided that the fragment retains a biological activity that is possessed by 
the full length polypeptide of i) or ii), or has an antigenic determinant in common with the polypeptide of 

20. i) or ii). As used herein, the term "fragment" refers to a polypeptide having an amino acid sequence that is 
the same as part, but not all, of an amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 
1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 
73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 103, 1 13, 115, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 
153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209, an amino acid sequence that is encoded by a 

25 nucleic acid sequence recited in any one of SEQ ID Nos. 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 1.0, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 
214 and 216, or an amino acid sequence that is encoded by a gene that is linked to a nucleic acid sequence 

30 recited in any one of these SEQ ID Nos. The fragments should comprise at least n consecutive amino 
acids from the sequence and, depending on the particular sequence, n preferably is 7 or more (for 
example, 8, 10, 12, 14, 16, 18, 20 or more). Small fragments may form an antigenic determinant. 

Such fragments may be isolated fragments, that are not part of or fused to other amino acids or 
polypeptides, or they may be comprised within a larger polypeptide, of which they form a part or region. 
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When comprised within a larger polypeptide, a fragment of the invention most preferably forms a single 
continuous region. For instance, certain preferred embodiments relate to a fragment having a pre - and/or 
pro- polypeptide region fused to the amino terminus of the fragment and/or an additional region fused to 
the carboxyl terminus of the fragment. However, several fragments may be comprised within a single 
5 larger polypeptide. 

The polypeptides of the present invention or their immunogenic fragments (comprising at least one 
antigenic determinant) can be used to generate ligands, such as polyclonal or monoclonal antibodies, that 
are immunospecific for the polypeptides. Such antibodies may be employed to isolate or to identify 
clones that express a polypeptide according to the invention or, for example, to purify the polypeptide by 
10 affinity chromatography. Such antibodies may also be employed as diagnostic or therapeutic aids, 
amongst other applications, as will be apparent to the skilled reader. 

The terra "immunospecific" means that an antibody has substantially greater affinity for a polypeptide 
according to the invention than their affinity for related polypeptides. As used herein, the term "antibody" 
is intended to include intact molecules as well as fragments thereof, such as Fab, F(ab') 2 and scFv, which 
15 are capable of binding to the antigenic determinant in question. 

The invention also includes functional equivalents of a polypeptide of i), ii) or (iii) as recited above. A 
functionally-equivalent polypeptide according to this aspect of the invention may be a polypeptides that is 
homologous to a polypeptide whose sequence is explicitly recited herein. Two polypeptides are said to be 
"homologous" if the sequence of one of the polypeptides has a high enough degree of identity or 

20 similarity to the sequence of the other polypeptide for the skilled person to determine that they are similar 
in origin and function. Preferably, homology is used to refer to sequence identity. "Identity" indicates that 
at any particular position in the aligned sequences, the amino acid residue is identical between the 
sequences. "Similarity" indicates that, at any particular position in the aligned sequences, the amino acid 
residue is of a similar type between the sequences. Degrees of identity and similarity can be readily 

25 calculated according to methods'known in the art (see, for example, Computational Molecular Biology, 
Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing. Informatics and Genome 
Projects, Smith, D.W ., ed., Academic Press, New York, 1993). Advantageously, the BLAST algorithm is 
employed, with parameters set to default values. The BLAST algorithm is described in detail at 
http://www.ncbi.nih.gov/BLAST/blast_help.html, which is incorporated herein by reference. The search 

30 parameters are defined as follows, and are advantageously set to the defined default parameters. 

Advantageously, "substantial homology 0 when assessed by BLAST equates to sequences which match 
with an EXPECT value of at least about 7, preferably at least about 9 and most preferably 10 or more. 
The default threshold for EXPECT in BLAST searching is usually 10. 
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BLAST (Basic Local Alignment Search Tool) is the heuristic search algorithm employed by the programs 
blastp, blastn, blastx, tblastn, and tblastx; these programs ascribe significance to their findings using the 
statistical methods of Karlin and Altschul (see http://www.ncbi. nih.gov/BLAST/blast_help : hlnil) with a 
few enhancements. The BLAST programs were tailored for sequence similarity searching, for example to 
5 identify homologues to a query sequence. The programs are not generally useful for motif-style 
searching. For a discussion of basic issues in similarity searching of sequence databases, see Altschul et 
al. (1994) Nature Genetics 6:119-129. 

The five BLAST programs available at http://www.ncbi.nlm.nih.gov perform the following tasks: 

blastp compares an amino acid query sequence against a protein sequence database; 

10 blastn compares a nucleotide query sequence against a nucleotide sequence database; 

' blastx compares the six-frame conceptual translation products of a nucleotide query sequence (both 
strands) against a protein sequence database; 

tblastn compares a protein query sequence against a nucleotide sequence database dynamically translated 
in all six reading frames (both strands). 

15 tblastx compares the six-frame translations of a nucleotide query sequence against the six-frame 
translations of a nucleotide sequence database. 

BLAST uses the following search parameters: 

HISTOGRAM Display a histogram of scores for each search; default is yes. (See parameter H in the 
BLAST Manual). 

20 DESCRIPTIONS Restricts the number of short descriptions of matching sequences reported to the 
number specified; default limit is 100 descriptions. (See parameter V in the manual page). See also 
EXPECT and CUTOFF. 

ALIGNMENTS Restricts database sequences to the number specified for which high-scoring segment 
pairs (HSPs) are reported; the default limit is 50. If more database sequences than this happen to satisfy 
25 the statistical significance threshold for reporting (see EXPECT and CUTOFF below), only the matches 
ascribed the greatest statistical significance are reported. (See parameter B in the BLAST Manual). 

EXPECT The statistical significance threshold for reporting matches against database sequences; the 
default value is 10, such that 10 matches are expected to be found merely by chance, according to the 
stochastic model of Karlin and Altschul (1990). If the statistical significance ascribed to a match is greater 
30 than the EXPECT threshold, the match will not be reported! Lower EXPECT thresholds are more 
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stringent, leading to fewer chance matches being reported. Fractional values are acceptable. (See 
parameter E in the BLAST Manual). 

CUTOFF Cutoff score for reporting high-scoring segment pairs. The default value is calculated from the 
EXPECT value (see above). HSPs are reported for a database sequence only if the statistical significance 
5 ascribed to them is at least as high as would be ascribed to a lone HSP having a score equal to the 
CUTOFF value. Higher CUTOFF values are more stringent, leading to fewer chance matches being 
reported. (See parameter S in the BLAST Manual). Typically, significance thresholds can be more 
intuitively managed using EXPECT. 

MATRIX Specify an alternate scoring matrix for BLASTP, BLASTX, TBLASTN and TBLASTX. The 
10 default matrix is BLOSUM62 (Henikoff & Henikoff, 1992). The valid alternative choices include: 
PAM40, PAM120, PAM250 and IDENTITY. No alternate scoring matrices are available for BLASTN; 
specifying the MATRIX directive in BLASTN requests returns an error response. 

STRAND Restrict a TBLASTN search to just the top or bottom strand of the database sequences; or 
restrict a BLASTN, BLASTX or TBLASTX search to just reading frames on the top or bottom strand of 
15 the query sequence. 

FILTER Mask off segments of the query sequence that have low compositional complexity, as 
determined by the SEG program of Wootton & Federhen (1993) Computers and Chemistry 17:149^163, 
or segments consisting of short-periodicity internal repeats, as determined by the XNU program of 
Claverie & States (1993) Computers and Chemistry 17:191-201, or, for BLASTN, by the DUST program 
20 of Tatusov and Lipman (see http://www.ncbi.nlm.nih.gov). Filtering can eliminate statistically significant 
but biologically uninteresting reports from the blast output (e.g., hits against common acidic-, basic- or 
proline-rich regions), leaving the more biologically interesting regions of the query sequence available for 
specific matching against database sequences. 

Low complexity sequence found by a filter program is substituted using the letter n N n in nucleotide 
25 sequence (e.g., "NNNNNNNNNNNNN") and the letter n X B in protein sequences (e.g., 
"XXXXXXXXX"). 

Filtering is only applied to the query sequence (or its translation products), not to database sequences. 
Default filtering is DUST for BLASTN, SEG for other programs. 

It is not unusual for nothing at all to be masked by SEG, XNU, or both, when applied to sequences in 
30 SW1SS-PR0T, so filtering should not be expected to always yield an effect. Furthermore, in some cases, 
sequences are masked in their entirety, indicating that the statistical significance of any matches reported 
against the unfiltered query sequence should be suspect. 
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NCBl-gi Causes NCBI gi idenlifiers to be shown in the output, in addition to the accession and/or locus 
name. 

Most preferably, sequence comparisons are conducted using the simple BLAST search algorithm 
provided at http://www.ncbi.nIm.nih.gov/BLAST. 

5 Alternatively, sequence homology may be determined by algorithms such as FastA, available at 
http://biology.ncsa.uiuc.edu/BW30/BW.cgi. FastA is considered to be superior to BLAST for alignment 
of short sequences. Advantageously, the FastA algorithm is employed using default parameters at 
http://biology.ncsa.uiuc.edn/B W 30/B W .cgi. 

Typically, greater than 50% identity between two polypeptides is considered to be an indication of 
10 functional equivalence, provided that either the biological activity of the polypeptide is retained or the 
polypeptides possess an antigenic determinant in common. Preferably, a functionally equivalent 
polypeptide according to this aspect of the invention exhibits a degree of sequence identity with a 
polypeptide sequence explicitly identified herein, or with a fragment thereof, of greater than 50%. More 
preferred polypeptides have degrees of identity of greater than 60%, 70%, 80%, 90%, 95%, 98% or 99%, 
15 respectively. 

Functionally-equivalent polypeptides according to the invention are therefore intended to include natural 
biological variants (for example, allelic variants or geographical variations within the species from which 
the polypeptides are derived) and mutants (such as mutants containing amino acid substitutions, insertions 
or deletions) of the polypeptides whose sequences are explicitly recited herein. Such mutants may include 
20 polypeptides in which one or more of the amino acid residues are substituted with a conserved or non- 
conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino 
acid residue may or may not be one encoded by the genetic code. Typical such substitutions are among 
Ala, Val, Leu and He; among Ser and Thr; among the acidic residues Asp and Giu; among Asn and Gin; 
among the basic residues Lys and Arg; or among the aromatic residues Phe and Tyr. 

25 Particularly preferred are variants in which several, i.e. between 5 and 1 0, 1 and 5,1 and 3, 1 and 2 or just 
! amino acids are substituted, deleted or added in any combination. Especially preferred are silent 
substitutions, additions and deletions, which do not alter the properties and activities of the protein. Also 
especially preferred in this regard are conservative substitutions. "Mutant" polypeptides also include 
polypeptides in which one or more of the amino acid residues include a substituent group. 

30 As discussed above, using a method according to the above-described aspects of the invention it has now 
been discovered, most surprisingly, that the response to hypoxia differs between different specialised cell 
types or between different physiological states of the same cell type. For example, it has been found that 
in macrophage cells, different polypeptides are induced/repressed during different physiological states. 
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Furthermore, it has been found that a subset of this group of polypeptides are regulated only in activated 
macrophage cells. Macrophages possess various biological activities, including cytotoxic effects towards 
tumour cells and phagocytosis of bacteria or cellular debris. These form an important and potent arm of 
innate immunity, and as such must be finely regulated. In the absence of interactions with pathogens or 
5 other immune cells, the aforementioned activities of the macrophage are greatly reduced (i.e. resting 
macrophages). When given appropriate stimuli, such as contact with the lipopolysaccharide surface of 
bacteria, and/or exposure to T-cell derived interferon gamma, the functional activities of the macrophage 
are greatly potentiated (i.e. activated macrophage). 

The expression of a further subset of these polypeptides has been found herein to be induced in activated 
10 macrophages under conditions of hypoxia, whilst a still further subset has been found herein to be 
repressed in activated macrophages under conditions of hypoxia. 

In resting macrophage cells, it has been found that different polypeptides are induced/repressed during the 
biological response to hypoxia. For example, it has been found that a subset of this group of polypeptides 
are regulated only in resting macrophage cells. The expression of a further subset of these polypeptides 
15 has been found herein to be induced in resting macrophages under conditions of hypoxia, whilst a still 
further subset has been found herein to be repressed in resting macrophages under conditions of hypoxia. 

According to a further aspect of the invention, there is provided a purified and isolated nucleic acid 
molecule that encodes a polypeptide according to any one of the aspects of the invention discussed above. 
Such a nucleic acid molecule may consist of the nucleic acid sequence as recited in any one of SEQ ID 

20 Nos. 2, 4, 6, 8, 10, 1 2, 14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 
58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 
108, 110, 1 12, 1 14, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 
150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 
192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, or form a redundant equivalent or 

25 fragment thereof. This aspect of the invention also includes a purified nucleic acid molecule which 
hydridizes under high stringency conditions with a nucleic acid molecule as described above. 

According to a further aspect of the invention, there is provided an expression vector that contains a 
purified and isolated nucleic acid molecule according to the aspects of the invention described above. The 
invention also incorporates a delivery vehicle, such as a liposome, comprising a nucleic acid according to 
30 the above-described aspects of the invention. 

In a further aspect, the invention provides a host cell transformed with a vector of the above-described 
aspect of the invention. 
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In a still further aspect, the invention provides a ligand that binds specifically to a polypeptide according 
to the above-described aspects of the invention. The ligand may be an antagonist ligand that inhibits the 
biological activity of the polypeptide, or may be an agonist ligand that activates the hypoxia-induced 
activity of the polypeptide to augment or potentiate a hypoxia-induced activity. 

5 In a still further aspect of the invention, there is provided a ligand which binds specifically to, and which 
preferably inhibits the hypoxia-induced activity of, a polypeptide according to any one of the above- 
described aspects of the invention. Such a ligand may, for example, bean antibody that is immunospecific 
for the polypeptide in question. 

According lo a further aspect, the invention provides a polypeptide, a nucleic acid molecule, vector or 
10 ligand as described above, for use in therapy or diagnosis of a disease or abnormal physiological 
condition. Preferably, the disease or abnormal physiological condition that is affected by hypoxia; 
examples of such diseases include cancer, ischaemic conditions (such as stroke, coronary arterial disease, 
peripheral arterial disease), reperfusion injury, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, 
inflammatory conditions (including rheumatoid arthritis), hair loss and wound healing. The undesired 
15 celluar process involved in said diseases might include, but is not restricted to; tumorigenesis, 
angiogenesis, apoptosis, inflammation or erythropoiesis. The undesired biochemical processes involved in 
said cellular processes might include, but is not restricted to, glycolysis, gluconeogenesis, glucose 
transportation, catecholamine synthesis, iron transport or nitric oxide synthesis. 

According to the invention, a number of known proteins have also been implicated in the biological 
20 response to hypoxia. The functions of these proteins are known, meaning that these functions have been 
annotated in the public databases. The sequences of these proteins are presented in SEQ ID Nos.: 217, 
219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 
261, 263, 265, 267, 269, 271 , 273, 275, 277, 279, 281 , 283, 285, 287, 289, 291 , 293, 295, 297, 299, 301 , 
303, 305, 307, 309, 311,313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 
25 345, 347, 349, 351 , 353, 355, 357, 359, 361, 363, 365, 367, 369, 371 , 373, 375, 377, 379, 381, 383, 385, 
387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 41 1,413, 415, 417, 419, 421, 423, 425, 427, 
429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 
47 1 , 473, 475, 477, 479, 481 , 483, 485 and 487 . 

According to a further aspect of the invention, there is provided a substantially purified polypeptide, 
30 which polypeptide: . 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 
1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 
57, 59, 63, 67, 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 103, 113, 1 15, 1 19, 121, 129, 



WO 02/46465 



PCT/CB01/05458 



22 



10 



5 



131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 or 
209 or any one of SEQ ID Nos.: 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 
239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 
307, 309, 311, 313, 315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 
341, 343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 
375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 
409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 
443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 
477, 479, 481 , 483, 485, 487, 489 and 491 ; 



15 



has an amino acid sequence encoded by a nucleic acid sequence recited in any one of 
SEQ ID Nos: 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 
98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 
168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 



202, 204, 206,208, 210,212, 214 and 216, or encoded by a gene identified from an EST 
recited in any one of these SEQ ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an 



iv) is a functional equivalent of a polypeptide of i), ii) or (iii); 

for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the biological response to 
hypoxia conditions, or a hypoxic-associated pathology. 

The invention also provides a purified and isolated nucleic acid molecule that encodes a polypeptide 
25 according to this aspect of the invention, for use in the diagnosis or therapy of tumourigenesis, 
angiogenesis, apoptosis, the biological response to hypoxia conditions, or a hypoxic-associated pathology. 
The sequences of these molecules are provided in SEQ ID Nos.: 218, 220, 222, 224, 226, 228, 230, 232, 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 
276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 
30 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 
360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 
402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 
444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 



20 



antigenic determinant in common with the polypeptide of i) or ii); or 
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486 and 488. As described above for the EST nucleic acid sequences annotated herein, this aspect of the 
invention includes redundant equivalents and fragments of the sequences explicitly recited in SEQ ID 
Nos.: 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 
258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 
5 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 
342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 
384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 
426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 
468, 470, 472, 474, 4.76, 478, 480, 482, 484, 486 and 488, and purified nucleic acid molecules which 
10 hybridize under high stringency conditions with such nucleic acid molecules, and vectors containing such 
nucleic acid molecules for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the 
biological response to hypoxia conditions, or a hypoxic-associated pathology. 

This aspect of the invention also includes ligands which bind specifically to, and which preferably inhibit 
the hypoxia-induced activity of, a polypeptide listed in SEQ ID Nos.: 217, 219, 221, 223, 225, 227, 229, 

15 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 
273, 275, 277, 279, 281, 283, 285, 287, 289, 291, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 
315, 317, 319, 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 
357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391, 393, 395, 397, 
399, 401, 403, 405, 407, 409, 41 1, 413, 415, 417, 419, 421, 423, 425, 427, 429, 431, 433, 435, 437, 439, 

20 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 
483, 485 and 487, for use in the diagnosis or therapy of tumourigenesis, angiogenesis, apoptosis, the 
biological response to hypoxia conditions, or a hypoxic-associated pathology. 

The invention also provides a pharmaceutical composition suitable for modulating hypoxia and/or 
ischaemia, comprising a therapeutically-effective amount of a a polypeptide, a nucleic acid molecule, 
25 vector or ligand as described above, in conjunction with a pharmaceutically-acceptable carrier. 

The invention also provides a vaccine composition comprising a polypeptide, or a nucleic acid molecule 
as described above. 

The invention also provides a method of treating a disease in a patient in need of such treatment by 
administering to a patient a therapeutically effective amount of a polypeptide, a nucleic acid molecule, 
30 vector, ligand or pharmaceutical composition as described above. For diseases in which the expression of 
the natural gene or the activity of the polypeptide is lower in a diseased patient when compared to the 
level of expression or activity in a healthy patient, the polypeptide, nucleic acid molecule, ligand, 
compound or composition administered to the patient should be an agonist. For diseases in which the 
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expression of the natural gene or activity of the polypeptide is higher in a diseased patient when compared 
to the level of expression or activity in a healthy patient, the polypeptide, nucleic acid molecule, vector, 
ligand, compound or composition administered to the patient is an antagonist. By the term "agonist" is 
meant herein, any polypeptide, peptide, synthetic molecule or organic molecule that functions as an 
5 activator, by increasing the effective biological activity of a polypeptide, for example, by increasing gene 
expression or enzymatic activity. By the term "antagonist" is meant herein, any polypeptide, peptide, 
synthetic molecule or organic molecule that functions as an inhibitor, by decreasing the effective 
biological activity of the gene product, for example, by inhibiting gene expression of an enzyme or a 
pharmacological receptor. 

10 The invention also provides for the use of a polypeptide, nucleic acid molecule, vector, ligand or 
pharmaceutical composition according to any one of the above-described aspects of the invention in 
modifying the response of a cell to conditions of hypoxia. 

The invention also provides a polypeptide, nucleic acid molecule, vector, ligand or pharmaceutical 
composition according to any one of the above-described aspects of the invention, for use in the 
15 manufacture of a medicament for the treatment of a hypoxia-regulated condition. 

The invention also provides a method of monitoring the therapeutic treatment of disease or physiological 
condition in a patient, comprising monitoring over a period of time the level of expression or activity of 
polypeptide, nucleic acid molecule, vector or ligand in tissue from said patient, wherein altering said level 
of expression or activity over the period of time towards a control level is indicative of regression of said 
20 disease or physiological condition. 

The invention also provides a method of providing a hypoxia regulating gene, an apoptotic or an 
angiogenesis regulating gene by administering directly to a patient in need of such therapy an expressible 
vector comprising expression control sequences operably linked to one or more of the nucleic acid 
molecules as described above. 

25 The invention also provides a method of diagnosing a hypoxia-regulated condition in a patient, 
comprising assessing the level of expression of a natural gene encoding a polypeptide according to any 
one of the aspects of the invention described above in tissue from said patient and comparing said level of 
expression or activity to a control level, wherein a level that is different to said control level is indicative 
of the hypoxia-related condition. 

30 Such a method of diagnosis may be carried out in vitro. One example of a suitable method comprises the 
steps of: (a) contacting a ligand as described above with a biological sample under conditions suitable for 
the formation of a ligand-poly peptide complex; and (b) detecting said complex. 
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A further example of a suitable method may comprises the steps of: a) contacting a sample of tissue from 
the patient with a nucleic acid probe under stringent conditions that allow the formation of a hybrid 
complex between a nucleic acid molecule whose sequence is recited in any one of SEQ ID Nos.: 2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 
5 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 
112, 114, 1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 
196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 
23S, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 

10 280, 282, 284, 286, 288, 290, 292, 294, 296, 298. 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 
322, 324, 326, 328, 330, 332, 334, 336. 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360, 362, 
364, 366, 368. 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 
406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 
448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486 and 

15 488, and the probe; b) contacting a control sample with said probe under the same conditions used in step 
a); and c) detecting the presence of hybrid complexes in said samples; wherein detection of levels of the 
hybrid complex in the patient sample that differ from levels of the hybrid complex in the control sample 
is indicative of the hypoxia-related condition. 

A still further example of a suitable method may comprise the steps of: a) contacting a sample of nucleic 
20 acid from tissue of the patient with a nucleic acid primer under stringent conditions that allow the 
formation of a hybrid complex between a nucleic acid molecule whose sequence is recited in any one of 
SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
25 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 
188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 
230, 232, 234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, 264, 266, 268, 270, 
272, 274, 276, 278, 280, 282, 284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 
314, 316, 31 8, 320, 322, 324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 
30 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 
398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 
440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 
482, 484, 486 and 488, and the primer; b) contacting a control sample with said primer under the same 
conditions used in step a); c) amplifying the sampled nucleic acid; and d) detecting the level of 
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amplified nucleic acid from both patient and control samples; wherein detection of levels of the amplified 
nucleic acid in the patient sample that differ significantly from levels of the amplified nucleic acid in the 
control sample is indicative of the hypoxia-related condition. 

A still further example of a suitable method may comprised the steps of: a) obtaining a tissue sample from 
5 a patient being tested for the hypoxia-related condition; b) isolating a nucleic acid molecule according to 
any one of the above-described aspects of the invention from said tissue sample; and c) diagnosing the 
patient for the hypoxia-related condition by detecting the presence of a mutation which is associated with 
the hypoxia-related condition in the nucleic acid molecule as an indication of the hypoxia-related 
condition. This method may comprise the additional step of amplifying the nucleic acid molecule to form 
10 an amplified product and detecting the presence or absence of a mutation in the amplified product. 

Particular hypoxia-related conditions that may be diagnosed in this fashion include cancer, ischaemia, 
reperfusion, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, rheumatoid arthritis, undesired 
hair loss, cardiac arrest or stroke, for example, caused by a disorder of the cerebral, coronary or peripheral 
circulation. 

15 In a further aspect, the invention provides a method for the identification of a compound that is effective 
in the treatment and/or diagnosis of a hypoxia-regulated condition, comprising contacting a polypeptide, 
nucleic acid molecule, or ligand according to any one of the above-described aspects of the invention with 
one or more compounds suspected of possessing binding affinity for said polypeptide, nucleic acid 
molecule or ligand, and selecting a compound that binds specifically to said nucleic acid molecule, 

20 polypeptide or ligand. 

According to a still further aspect of the invention, there is provided a kit useful for diagnosing a hypoxia- 
regulated condition, comprising a first container containing a nucleic acid probe that hybridises under 
stringent conditions with a nucleic acid molecule according to any one of the aspects of the invention 
described above; a second container containing primers useful for amplifying said nucleic acid molecule; 
25 and instructions for using the probe and primers for facilitating the diagnosis of the hypoxia-regulated 
condition. The kit may additionally comprise a third container holding an agent for digesting 
unhybridised RNA. 

To facilitate in the diagnosis of the hypoxia-regulated condition using one of the methods outlined above, 
in a further aspect, the invention provides an array of at least two nucleic acid molecules, wherein each of 
30 said nucleic acid molecules either corresponds to the sequence of, is complementary to the sequence of, 
or hybridises specifically to a nucleic acid molecule according to any one of the aspects of the invention 
described above. Such an array may contain nucleic acid molecules that either correspond to the sequence 
of, are complementary to the sequence of, or hybridise specifically to at least 2, 3, 4, 5, 6, 7,8,9, 10, 11, 
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12, 13, 14, 15, 16, 1 7, 1 8, 19,20,21,22,23,24,25,26,27,28,29, 30,31,32,33,34,35,36,37,38,39, 
40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 
68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81 , 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 92a, 93, 94, 
95,96, 97,98,99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 
5 117, 118, 1 19, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 
138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152,153, 154, 155, 156, 157, 158, 
159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 
180, 181, 182, 183, 184, 185, 386, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 
201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 21 1,212, 213, 214, 215, 216, 215, 217, 218, 219, 220, 

10 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 
242, 243, 244, 245, 246, 247, 248, 249, 250, 251 , 252, 253, 254, 255, 256, 257, 258, 259, 260, 261 , 262, 
263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 
284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295 or more of (he nucleic acid molecules 
implicated in a hypoxia-regulated condition as recited above. The nucleic acid molecules on the array 

15 may consist of oligonucleotides of between twelve and fifty nucleotides, more preferably, between forty 
and fifty nucleotides. Alternatively, the nucleic acid molecules on the array may consist of PCR-amplified 
cDNA inserts where the nucleic acid molecule is between 300-2000 nucleotides. 

In a related aspect, again useful for diagnosis, the invention provides an array of antibodies, comprising at 
least two different antibody species, wherein each antibody species is immunospecific with a polypeptide 
20 implicated in a hypoxia-regulated condition as described above. The invention also provides an array of 
polypeptides, comprising at least two polypeptide species as recited above, wherein each polypeptide 
species is implicated in a hypoxia-regulated condition, or is a functional equivalent variant or fragment 
thereof. 

Kits useful in the diagnostic methods of the invention may comprise such nucleic acid, antibody and/or 
25 polypeptide arrays. 

According to the invention, a kit may also comprise one or more antibodies that bind to a polypeptide as 
recited above, and a reagent useful for the detection of a binding reaction between said antibody and said 
polypeptide. 

According to a still further aspect of the invention, there is provided a genetically-modified non-human 
30 animal that has been transformed to express higher, lower or absent levels of a polypeptide according to 
any one of the aspects of the invention described above. Preferably, said genetically-modified animal is a 
transgenic or knockout animal. 
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The invention also provides a method for screening for a compound effective to treat a hypoxia-regulated 
condition, by contacting a non-human genetically-modified animal as described above with a candidate 
compound and determining the effect of the compound on the physiological state of the animal. 

As discussed in some detail above, ischaemic disease pathologies involve a decrease in the blood supply 
5 to a bodily organ, tissue or body part generally caused by constriction or obstruction of the blood vessels. 
One particular example of an ischaemic disease pathology is myocardial ischaemia, which encompasses 
several chronic and acute cardiac pathologies that involve the deprivation of the myocardium of its blood 
supply, usually through coronary artery occlusion. A key component of ischaemia is hypoxia. Following 
transient ischaemia, the affected tissue may be subjected to reperfusion and re -oxygenation, and this is of 
10 significance in its own right. 

Ischaemia/reperfusion is well known to induce cell death in myocardial tissue by apoptosis, leading to 
impaired function of the myocardium and infarction. Many of the specific molecules required to execute 
the process of apoptosis are known, but not all of these molecules have been characterised in detail. Cell 
death may also proceed by a distinct process called necrosis, which unlike apoptosis, is not initiated and 

15 controlled by specific and dedicated cellular and biochemical mechanisms (see Nicotera et al, Biochem 
Soc Symp. 1999; 66:69-73). There is substantial evidence that apoptotic cell death occurs either during or 
after myocardial ischaemia (Kajstura et al, Lab Invest. 1996; 74(1):86-I07; Cheng et al, Exp Cell Res. 
1996; 226(2):316-27; Fliss and Gattinger, Circ Res. 1996; 79(5):949-56; Veinot et al, Hum Pathol. 1997; 
28(4):485-92; Bialik et al, J Clin Invest. 1997; 1 00(6): 1 363-72 ; Gottlieb et al, J Clin Invest. 1994; 

20 94(4):l621-8; Gottlieb and Engler, Ann N Y Acad Sci. 1999; 874:412-26). In the laboratory, apoptosis is 
also induced by subjecting cardiac myocytes to hypoxia (Tanaka et al, Circ Res. 1994 Sep;75(3):426-33; 
Long et al, J Clin Invest. 1997 99(1 1): 2635-43). 

Clearly, there is a significant clinical application were a successful method to inhibit apoptosis in 
ischaemic myocardial tissue to be devised. A specific and effective treatment requires identifying 

25 biochemical target(s), which are responsible for mediating apoptosis, specifically in ischaemic myocardial 
cells. One target which plays a common role in mediating apoptosis in many cell types, namely p53, is 
not involved in apoptosis resulting from myocardial ischaemia (Bialik et al, J Clin Invest. 1997; 
1 00(6): 1363-72). Others have shown that inhibiting key mediators of apoptosis, caspases, provides 
protection against lethal reperfusion injury, following myocardial ischaemia in rat models (Mocanu el al, 

30 Br J Pharmacol. 2000; 1 30(2): 1 97-200 ; Yaoita et al, Circulation. 1998 97(3): 276-81; Holly et al, J Mol 
Cell Cardiol. 1999 31(9): 1709-15). However, this approach lacks specificity, since the caspases play a 
key role in mediating apoptosis in the majority of mammalian cell types, where it is usually beneficial. An 
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approach that involves modulating the activity of molecules shown specifically to mediate apoptosis in 
ischaemic cardiac cells, would present a distinct advantage in both specificity and efficacy. 

It has now been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
ID No 86, having the Protein accession number BABI510I (encoded by Homo sapiens cDNA: FLJ2I620 
5 fis, clone COL07838' Nucleotide accession AK025273) is regulated by hypoxia. Other public domain 
sequences corresponding to this gene include Homo sapiens cDNA: FLJ23265 fis, clone COL06456 
Nucleotide accession AK026918. Accordingly, when referring in the present specification to the EST 
recited in SEQ ID No 86, it is intended that these gene and protein sequences are also embraced. This 
gene was identified using Research Genetics Human GeneFilters arrays, which contain an EST 
10 corresponding to the gene (accession number R00332). In the art, the gene is now termed EGL nine 
(C.elegans) homolog 3. 

There are no reports that describe the function of this human gene. However, a high degree of amino acid 
homology is observed between the protein encoded by this gene, and a rat protein called "Growth factor 
responsive smooth muscle protein" or "SM20" (Nucleotide accession U06713; Protein accession 
15 A53770). An alignment of single letter amino acid sequences is shown below. Over the highlighted 
region there is 97% amino acid similarity and 96% amino acid identity. 
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The high degree of amino acid similarity suggests that the human protein BAB 15101 has an equivalent 
biochemical function to the rat protein A53770 ("Growth factor responsive smooth muscle protein" or 
"SM20"). Recent publications have shown that SM20 functions to promote apoptosis in neurons 
(Lipscomb et al % J Neurochem 1999; 73(l):429-32; Lipscomb et al, J Biol Chem 2000 Nov 1; [epub 
5 ahead of print]). Significantly, SM20 has been shown to be expressed at high levels in the heart (Wax et 
ai, J Biol Chem 1994; 269(17): 13041-7). 

It has also been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
ID No 90, having the Protein accession number CAB 8 1 622, is regulated by hypoxia. The encoding 
human gene has been annotated in the UniGene database as "Similar to rat smooth muscle protein SM- 
10 20"; the nucleotide sequence is contained within the nucleotide accession AL1 17352. More recently, a 
longer fragment of this gene has been cloned, named clorfl2, or EGLN1 (Nucleotide accession 
AAG34568; Protein accession AAG34568). Accordingly, when referring in the present specification to 
the EST recited in SEQ ID No 90, it is intended that these gene and protein sequences are also embraced. 

This distinct human gene, encoding a protein related to SM20 and EGLN3 (BAB 15101), is also induced 
15 in response to hypoxia. This gene was identified using Research Genetics Human GeneFilters arrays, 
which contain an EST corresponding to the gene (accession number H56028). 

Independently to this, a fragment of this gene has been cloned from a cDN A library derived from hypoxic 
human cardiomyoblasts, and it has been shown that the gene is increased in expression in response to 
hypoxia in this cell type (see Table I herein; penultimate row). The nucleotide sequence of this cDNA 
20 fragment is referred to herein as SEQ ID No 90a. 

In the light of this novel discovery reported herein that these human equivalents of SM20 are induced by 
hypoxia, it is herein proposed that in cardiac ischaemia, the resulting apoptosis is due at least in part, to 
increased expression of these genes. 

The therapeutic modulation of the activity of EGLN3 (BAB15101), clorfl2 (AAG34568), CAB81622, 
25 SM20 and other equivalent proteins and encoding genes therefore provides a novel means for the 
treatment of myocardial ischaemia, through the alteration of the propensity of myocardial cells to undergo 
apoptosis. For example, a suitable treatment may involve altering the susceptibility of ischaemic 
myocardial tissue to subsequent reperfusion and re-oxygenation, or may involve modulating the 
susceptibility of chronic ischaemic myocardial tissue (including forms of angina) to later more severe 
30 ischaemia, which would result in myocardial infarction. It is submitted that, by way of analogy, cerebral 
ischaemia may be treated using the same principle. 

These data provide the first connection between these related genes and the physiological response to 
hypoxia. Recently published research papers have identified that the protein products of these genes can 



WO 02/46465 



PCT/G B0 1/05458 



31 

act as proline hydroxylases (see Bruick RK et al Science. 2001 294:1337-40 and Epstein AC et al Cell. 
107:43-54). This is consistent with our observations that certain proline hydroxylases are induced in 
response to hypoxia and the genes EGLN1 and EGLN3 are part of the hypoxia response. For example, 
two genes encoding proline hydroxylases have been identified herein as being increased in expression in 
5 response to hypoxia (proline 4-hydroxylase, alpha polypeptide 1 ; SeqID: 231/232, proline 4-hydroxylase, 
alpha polypeptide II; SeqID: 349/ 350). This identified a functional significance of proline hydroxylation 
as a response to hypoxia. A preferred embodiment of the invention thus includes methods for modulating 
the biological response to hypoxia by modulating the proline hydroxylase activity of the EGLN3 
(BAB15101),clorfl2 (AAG34568), CAB81622 and SM20 proteins. 

10 Furthermore, a number of bacteria, such as moraxella, are thought to be involved in the initiation of 
inflammatory diseases. Many bacteria contain, within their genome, genes encoding proteins that share 
homology to the EGLN family of prolyl hydroxylases. We therefore propose that these bacterial genes 
may initiate a hypoxic like response at the site of infection thereby causing localised inflammation. The 
resulting inflammatory infiltrate could then cause the tissue to become hypoxic thereby continuing the 

15 cycle of hypoxia response. 

As discussed in detail above, fragments and functional equivalents of the EGLN3 (BAB 1 5 101 ), cl orfl 2 
(AAG34568), CAB81622, SM20 and other equivalent proteins are included within the present invention, 
in addition to ligands that bind specifically to these proteins. Furthermore, the invention also embraces 
purified and isolated nucleic acid molecules encoding these proteins, fragments and functional 
20 equivalents, vectors containing such nucleic acid molecules and host cells transformed with these vectors. 

The therapeutic and diagnostic applications discussed above are also equally relevant to this aspect of the 
invention. For example, small molecule inhibitors of the EGLN3 (BAB 1 5 1 0 1 ), clorf 1 2 (AAG34568), 
CAB81622, SM20 and equivalent proteins and encoding genes are envisaged for utility as pharmaceutical 
agents, particularly in modulating the proline hydroxylase activity of the EGLN3 and c J orfl 2 proteins. 

25 Truncated or chimeric inhibitory derivatives of the encoding genes, or distinct genes that encode 
regulators of the BAB 15101 , A AG34568, CAB81622 and SM20 encoding genes, are also envisaged for 
utility for gene therapy. 

An alignment of the amino acid sequences of rat SM20 (Accession A53770), its human equivalent 
(Accession BAB 15101 ; SEQ ID No: 85) and this distinct human homologue (Accession CAB81622 or 
30 AAG34S68;SEQIDNo:89)isshownbelow: 

1 50 

BAB15101 (1) 

A53770 (1) 

35 AAG34568 (1) MANDSGGPGGPSPSERDRQYCELCGKMENLLRCSRCRSSFYCCKEHQRQD 
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From this sequence alignment, a highly conserved region of amino acid sequence may be noted, the 
consensus of which is as follows: 

KAMVACYPGNGTGYVRHVDNPNGDGRCITCIY^ 
WSDRRNPHEVQ PSYATRYAMTVWYFDAEERAEAKKK 
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This consensus sequence, and variants thereof, may be used in the identification of other proteins that are 
implicated in the biological response to hypoxia. This aspect of the invention therefore provides a 
substantially purified polypeptide comprising the consensus sequence: 

5 KAMVACYPGNGTGYVRHVDNPNG 

WSDRRNPHEVQPSYATRYAl^m/WYFDAEERAEAKKK, or a variant thereof. 

The invention also provides a substantially purified polypeptide comprising the consensus sequence: 
KAMVACYPGNGTGYVRKVDN 
10 WSDRRNPHEVQPSYATRYAMTVWYFDAEERAEAKKK , or a variant thereof, in the treatment or diagnosis 
of a hypoxia-related disease or condition. 

Neither this consensus domain nor any proteins that contain this domain have been previously associated 
with the cellular response to hypoxia/ischaemia. Searches of the public databases indicate that the human 
genome contains several genes that encode proteins that contain this consensus sequence. These proteins 
15 may have similar functions or may function in the same biochemical pathway, potentially with an 
antagonistic effect. 

By "variant" is meant a variation of the consensus sequence given above, that exhibits a degree of 
homology with the consensus sequence above a certain threshold level of identity or similarity. Degrees 
of identity and similarity can be readily calculated according to methods known in the art (see, for 

20 example, Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing. Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993). 
Typically, greater than 50% identity between two sequences is considered to be an indication of 
functional equivalence. Preferably, a variant consensus according to this aspect of the invention exhibits a 
degree of sequence identity with the consensus sequence given above, of greater than 50%. More 

25 preferred polypeptides have degrees of identity of greater than 60%, 70%, 80%, 90%, 95%, 98% or 99%, 
respectively. 

As discussed in detail above, fragments and functional equivalents of these proteins are included within 
the present invention, in addition to ligands that bind specifically to these proteins. Furthermore, the 
invention also embraces purified and isolated nucleic acid molecules encoding these proteins, fragments 
30 and functional equivalents, vectors containing such nucleic acid molecules and host cells transformed 
with these vectors. The therapeutic and diagnostic applications discussed above are also equally relevant 
to this aspect of the invention. 
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The polypeptide referred to above as that encoded by SEQ ID No 91 is a specific protein that is termed 
"Semaphorin 4b\ The gene encoding this protein is regulated (activated) by conditions of hypoxia. The 
Semaphorin 4b protein is encoded by a gene identified from the EST recited in SEQ ID No 92. The 
unequivocal and accurate full length cDNA sequence is provided herein as SEQ ID No 92a. The accurate 
5 presumptive amino acid sequence is provided herein as SEQ ID No 91. This protein, functionally- 
equivalent variants of this protein, the encoding nucleic acid molecules and ligands that regulate the 
activity and/or expression of this gene and protein are claimed above in the context of their role in 
hypoxia and hypoxia-related disorders. 

Semaphores are a large family of proteins, characterised by the 500 amino acid sema domain (Puschel et 
10 al., 1995, Neuron, 14(5): 941-8; Tamagnone and Comoglio, 2000, Trends Cell Biol., 10(9): 377-83). 
Early work showed a role in the guidance of axons during brain development, and the regulation of cell 
migration. More recently, specific members of this large family have been associated with cancer 
(Brambilla et al., Am J Pathol., 2000, 156(3): 939-50), rheumatoid arthritis (Mangasser-Stephan et al., 
Biochem Biophys Res Commun., 1997, 234(1): 153-6), the immune system (Spriggs, Curr Opin 
15 Immunol., 1999, 1 1(4): 387-91) including B-lymphocyte functions (Hall et al., Proc Natl Acad Sci USA, 
1996, 93(21 ): 1 1780-5) and angiogenesis (Miao et aL, J Cell Biol., 1999, 146(1): 233-42). This is perhaps 
not surprising considering that cell migration / trafficking is a key part of inflammation, angiogenesis and 
tumour metastasis. 

There are at least distinct 25 human semaphorin genes and the significance/ utility of many of these 
20 remains untested. This includes the Semaphorin 4b protein, which is unpublished and until now has not 
been assigned a full and accurate amino acid sequence. 

We have made experimental discoveries which link the expression of Semaphorin 4b to factors (hypoxia, 
gamma IFN and superoxide radicals) that are associated with a variety of human ischaemic and 
inflammatory diseases. In particular, a key response of cells to hypoxia is to stimulate angiogenesis, and a 

25 key part of inflammation is the recruitment and trafficking of immune cells. In light of our discoveries, 
and what is known about other specific members of the semaphorin family, it is herein proposed that 
Semaphorin 4b is a regulator of these cellular functions, and thus provides a novel target for therapeutic 
intervention. This paves the way for the development of therapeutic agents that either potentiate or 
antagonise functions of Semaphorin 4b. Such agents are likely to be highly valuable in the treatment of 

30 human disease. 

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology, microbiology, recombinant DNA technology and immunology, which are within the 
skill of those working in the art. 
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Mosl genera] molecular biology, microbiology recombinant DNA technology and immunological 
techniques can be found in Sambrook et al % Molecular Cloning, A Laboratory Manual (1989) Cold 
Harbor-Laboratory Press, Cold Spring Harbor, N.Y. or Ausubel et ai, Current protocols in molecular 
biology (1990) John Wiley and Sons, N.Y. 

5 Unless defined otherwise, all technical and scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to which this invention belongs. 

A. Polypeptides 

The term "polypeptide" as used herein, refers to a chain (may be branched or unbranched) of two or more 
amino acids linked to each other by means of a peptide bond or modified peptide bond (isosteres). The 
10 term polypeptide encompasses but is not limited to oligopeptides, peptides and proteins. The polypeptide 
of the invention may additionally be either in a mature protein form or in a pre-, pro- or prepro-protein 
form that requires subsequent cleavage for formation of the active mature protein. The pre-, pro-, prepro- 
part of the protein is often a leader or secretory sequence but may also be an additional sequence added to 
aid protein purification (for example, a His tag) or to conform a higher stability to the protein. 

15 A polypeptide according to the invention may also include modified amino acids, that is, amino acids 
other than those 20 that are gene-encoded. This modification may be a result of natural processes such as 
post-translational processing or by chemical modification. Examples of modifications include acetylation, 
acylation, amidation, ADP-ribosylation, arginylation, attachment of a lipid derivative or 
phosphatidylinositol, y-carboxylation, covalent attachment of a flavin or haeme moiety, a nucleotide or 

20 nucleotide derivative, cyclisation, demethylation, disulphide bond formation, formation of covalent cross- 
links, formylation, glycosylation, GPI anchor formation, hydroxylation, iodination, lipid attachment, 
methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 
racemisation, selenoylation, sulphation, and ubiquitination. Modification of the polypeptide can occur 
anywhere within the molecule including the backbone, the amino acid side-chains or at the N- or C- 

25 terminals. 

A polypeptide according to the invention may either be isolated from natural sources (for example, 
purified from cell culture), or be a recombinantly produced polypeptide, or a synthetically produced 
polypeptide or a combination^ all the above. 

Antibodies 

30 A polypeptide according to the invention, its functional equivalents and/or any immunogenic fragments 
derived from the polypeptide may be used to generate ligands including immunospecific monoclonal or 
polyclonal antibodies, or antibody fragments. These antibodies can then be used to isolate or identify 
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clones expressing the polypeptide of (he invention or to purify the polypeptide by affinity 
chromatography. Further uses of these immunospecific antibodies may include, but are not limited to, 
diagnostic, therapeutic or general assay applications. Examples of assay techniques that employ 
antibodies are immunoassays, radioimmunoassays (RIA) or enzyme linked immunosorbent assay 
5 (ELJSA). In these cases, the antibodies may be labelled with an analytically-detectable reagent including 
radioisotopes, a fluorescent molecule or any reporter molecule. 

The term "immunospecific" as used herein refers to antibodies that have a substantially higher affinity for 
a polypeptide of this invention compared with other polypeptides. The term "antibody" as used herein 
refers to a molecule that is produced by animals in response to an antigen and has the particular property 

10 of interacting specifically with the antigenic determinant that induced its formation. Fragments of the 
aforementioned molecule such as Fab, F(ab') 2 and scFv, which are capable of binding the antigen 
determinant, are also included in the term "antibody". Antibodies may also be modified to make chimeric 
antibodies, where non-human variable regions are joined or fused to human constant regions (for 
example, Liu et a/., PNAS, USA, 84, 3439 (1987)). Particularly, antibodies may be modified to make 

15 them less immunogenic to an individual in a process such as humanisation (see, for example, Jones et al, 
Nature, 321, 522 (1986); Verhoeyen et ai, Science, 239, 1534 (1988); Kabat et ai, J. Immunol., 147, 
1709 (1991); Queen et a/., PNAS, USA, 86, 10029 (1989); Gorman et a/., PNAS, USA, 88, 34181 (1991) 
and Hodgson et ai y Bio/Technology, 9, 421 (1991)). The term "humanised antibody", as used herein, 
refers to antibody molecules in which the amino acids of the CDR (complementarity-determining region) 

20 and selected other regions in the variable domains of the heavy and/or light chains of a non-human donor 
antibody have been substituted with the equivalent amino acids of a human antibody. The humanised 
antibody therefore closely resembles a human antibody, but has the binding ability of the donor antibody. 
Antibodies may also have a "bispecific" nature, that is, the antibody has two different antigen binding 
domains, each domain being directed against a different epitope. 

25 Specific polyclonal antibodies may be made by immuno-challenging an animal with a polypeptide of this 
invention. Common animals used for the production of antibodies include the mouse, rat, chicken, rabbit, 
goat and horse. The polypeptide used to immuno-challenge the animal may be derived by recombinant 
DNA technology or may be chemically-synthesised. In addition, the polypeptide may be conjugated to a 
carrier protein. Commonly used carriers to which the polypeptides may be conjugated include, but are not 

30 limited to BSA (bovine serum albumin), thyroglobulin and keyhole limpet haemocyanin. Serum from the 
immuno-challenged animal is collected and treated according to known procedures, for example, by 
immunoaffinity chromatography. 

Specific monoclonal antibodies can generally be made by methods known to one skilled in the art (see for 
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example, Kohler, G. and Milstein, C, Nature 256, 495-497 (1975); Kozbor et ai, Immunology Today 4: 
72 (1983); Cole et al, 77-96 in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985) 
and Roitt, 1. et Immunology, 25.10, Mosby-Year Book Europe Limited (1993)). Panels of monoclonal 
antibodies produced against the polypeptides of the invention can be screened for various properties, i.e., 
5 for isotype, epitope, affinity, etc. against which they are directed. Alternatively, genes encoding the 
monoclonal antibodies of interest may be isolated from hybridomas, for instance using PCR techniques 
known in the art, and cloned and expressed in appropriate vectors. 

Phage display technology may be utilised to select the genes encoding the antibodies that have exhibited 
an immunspecific response to the polypeptides of the invention (see McCafferty, J., et al, (1990), Nature 
10 348, 552-554; Marks, J. et al, (1992) Biotechnology 10,779-783). 

Ligands 

The polypeptides of the invention may also be used to search for interacting ligands. Methods for doing 
this include the screening of a library of compounds (see Coligan et al t Current Protocols in Immunology 
1(2); Chapter 5 (1991), isolating the ligands from cells, isolating the ligands from a cell-free preparation 

15 or natural product mixtures. Ligands to the polypeptide may activate (agonise) or inhibit (antagonise) its 
activity. Alternatively, compounds may affect the levels of the polypeptide present in the cell, including 
affecting gene expression, mRNA stability and the degree of post-translational modification of the 
encoded protein. The invention thus embraces methods for the identification of a compound that is 
effective in the treatment and/or diagnosis of disease, comprising contacting a polypeptide, a nucleic acid 

20 molecule or host cell according to any one of the embodiments of the invention described herein with one 
or more compounds suspected of possessing binding affinity for said polypeptide or nucleic acid 
molecule, and selecting a compound that binds specifically to said nucleic acid molecule or polypeptide, 
or that affects the level of gene expression, mRNA stability or the degree of post-translational 
modification of the encoded protein. 

25 Ligands to the polypeptide form a further aspect of the invention, as discussed in more detail above. 
Preferred "antagonist" ligands include those that bind to the polypeptide of this invention and strongly 
inhibit any activity of the polypeptide. Preferred "agonist" ligands include those that bind to the 
polypeptide and strongly induce activity of the polypeptide of this invention or increases substantially the 
level of the polypeptide in the cell. As defined above, the term "agonist" is meant to include any 

30 polypeptide, peptide, synthetic molecule or organic molecule that functions as an activator, by increasing 
the effective biological activity of a polypeptide, for example, by increasing gene expression or enzymatic 
activity. The term "antagonist" is meant to include any polypeptide, peptide, synthetic molecule or 
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organic molecule that functions as an inhibitor, by decreasing the effective biological activity of the gene 
product, for example, by inhibiting gene expression of an enzyme or a pharmacological receptor. 

Ligands to a polypeptide according to the invention may come in various forms, including natural or 
modified substrates, enzymes, receptors, small organic molecules such as small natural or synthetic 
5 organic molecules of up to 2000Da, preferably 800Da or less, peptidomimetics, inorganic molecules, 
peptides, polypeptides, antibodies, structural or functional mimetics of the aforementioned. 

B. Nucleic acid molecules 

Preferred nucleic acid molecules of the invention are those which encode the polypeptide sequences 
recited in any one of SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 

10 41,43,45,47,49,51,53,55,57,59,63,67,69,73,75,77,85,87, 89,91,93,95,99,103, 1 13,1 15, 1 19, 
121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209. 
Examples of such nucleic acid molecules include those listed in SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 
164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 
206, 208, 210, 212, 214 and 216, homologous nucleic acids and nucleic acids that are complementary to 
these nucleic acid molecules. Nucleic acid molecules of this aspect of the invention may be used in 
numerous methods and applications, as described generally herein. A nucleic acid molecule preferably 

20 comprises of at least n consecutive nucleotides from any one of the sequences disclosed in SEQ ID Nos. 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 
110, 112, 1 14, 116, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 

25 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216, where n is 10 or more. A nucleic acid 
molecule of the invention also includes sequences that are complementary to the nucleic acid molecule 
described above (for example, for antisense or probing purposes). 

A nucleic acid molecule according to this aspect of the invention may be in the form of RNA, such as 
mRNA, DNA, such as cDNA, synthetic DNA or genomic DNA. The nucleic acid molecule may be 
30 double-stranded or single-stranded. The single-stranded form may be the coding (sense) strand or the non- 
coding (antisense) strand. A nucleic acid molecule may also comprise an analogue of DNA or RNA, 
including, but not limited to modifications made to the backbone of the molecule, such as, for example, a 
peptide nucleic acid (PNA). The term "PNA" as used herein, refers to an antisense molecule that 
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comprises an oligonucleotide of at least five nucleotides in length linked to a peptide backbone of amino 
acid residues, preferably ending in lysine. The terminal lysine confers solubility to the composition. PN As 
may be pegylated to extend their lifespan in a cell, where they preferentially bind complementary single- v 
stranded DNA and RNA and stop transcript elongation (Nielsen, P.E. et ai (1993) Anticancer Drug Des. 
5 8:53 63). 

A nucleic acid molecule according to this aspect of the invention can be isolated by cloning, purification 
or separation of the molecule directly from a particular organism, or from a library, such as a genomic or 
cDNA library. The molecule may also be synthesised, for example, using chemical synthetic techniques 
such as solid phase phosphoramidite chemical synthesis. RNA may be synthesized in vitro or in vivo by 
10 transcription of the relevant DNA molecule. 

Due to the degeneracy of the genetic code, differing nucleic acid sequences may encode the same 
polypeptide (or mature polypeptide). Thus, nucleic acid molecules included in this aspect of the invention 
include any molecule comprising a variant of the sequence explicitly recited. Such variants may include 
variant nucleic acid molecules that code for the same polypeptide (or mature polypeptide) as. that 

15 explicitly identified, that code for a fragment of the polypeptide, that code for a functional equivalent of 
the polypeptide or that code for a fragment of the functional equivalent of the polypeptide. Also included 
in this aspect of the invention, are variant nucleic acid molecules that are derived from nucleotide 
substitutions, deletions, rearrangements or insertions or multiple combinations of the aforementioned. 
Such molecules may be naturally occurring variants, such as allelic variants, non-naturally occurring 

20 variants such as those created by chemical mutagenesis, or variants isolated from a species, cell or 
organism type other than the type from which the sequence explicitly identified originated. Variant 
nucleic acid molecules may differ from the nucleic acid molecule explicitly recited in a coding region, 
non-coding region or both these regions. 

Nucleic acid molecules may also include additional nucleic acid sequence to that explicitly recited, for 
25 example, at the 5' or 3* end of the molecule. Such additional nucleic acids may encode for a polypeptide 
with added functionality compared with the original polypeptide whose sequence is explicitly identified 
herein. An example of this would be an addition of a sequence that is heterologous to the original nucleic 
acid sequence, to encode a fusion protein. Such a fusion protein may be of use in aiding purification 
procedures or enabling techniques to be carried out where fusion proteins are required (such as in the 
30 yeast two hybrid system). Additional sequences may also include leader or secretory sequences such as 
those coding for pro-, pre- or prepro- polypeptide sequences. These additional sequences may also 
include non-coding sequences that are transcribed but not translated including ribosome binding sites and 
termination signals. 
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A nucleic acid molecule of the invention may include molecules that are at least 70% identical over their 
entire length lo a nucleic acid molecule as explicitly identified herein in SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 
5 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,200, 
202, 204, 206, 208, 210, 212,214 and 216. Preferably, a nucleic acid molecule according to this aspect of 
the invention comprises a region that is at least 80% identical over its entire length to a nucleic acid 
molecule as explicitly identified herein in these SEQ ID Nos., preferably at least 90%, more preferably at 
10 least 95% and most preferably at least 98% or 99% identical. Further preferred embodiments include 
nucleic acid molecules that encode polypeptides that retain substantially the same biological function or 
activity as the polypeptide explicitly identified herein. The terms "homology" and "identity" should be 
given the meanings described in detail above with respect to polypeptide analysis. Preferably, nucleotide 
homology and identity are assessed using the blastn program available at http://www.ncbi.nlm.nih.gov. 

15 The nucleic acid molecules of the invention can also be engineered using methods generally.known in the 
art. These methods include but are not limited to DNA shuffling; random or non-random fragmentation 
(by restriction enzymes or shearing methods) and reassembly of fragments; insertions, deletions, 
substitutions and rearrangements 8f sequences by site-directed mutagenesis (for example, by PCR). These 
alterations may be for a number of reasons including for ease of cloning (such as introduction of new 

20 restriction sites), altering of glycosylation patterns, changing of codon preferences, splice variants 
changing the processing, and/or expression of the gene product (the polypeptide) in general or creating 
fusion proteins (see above). 

Hybridisation 

Nucleic acid molecules of the invention may also include antisense molecules that are partially 
25 complementary to a nucleic acid molecule as explicitly identified herein in SEQ ID Nos.: 2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100,. 102, 104, 106, 108, 110, 112, 1 14, 
1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 
30 200, 202, 204, 206, 208, 210, 212, 214 and 216, and which therefore will hybridise to the encoding 
nucleic acid molecules. These antisense molecules, including oligonucleotides, can be designed to 
recognise, specifically bind to and prevent transcription of a target nucleic acid encoding a polypeptide of 
the invention, as will be known by those of ordinary skill in the art (see Cohen, J.S., Trends in Pharm. 
Sci., 10, 435 (1989), Okano, J. Neurochem. 56, 560 (1991); O'Connor, J. Neurochem 56, 560 (1991); Lee 
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et <*/., Nucleic Acids Res 6, 3073 (1979); Cooney et a/., Science 241, 456 (1988); Dervan et ai., Science 
251,1360 (1991). 

The term "hybridisation" used herein refers to any process by which a strand of nucleic acid binds with a 
complementary strand of nucleic acid by hydrogen bonding, typically forming Watson-Crick base pairs. 
5 As carried out in vitro, one of the nucleic acid populations is usually immobilised to a surface, whilst the 
other population is free. The two molecule types are then placed together, under conditions conducive to 
binding. 

The phrase "stringency of hybridisation" refers to the percentage of complementarity that is needed for 
duplex formation. "Stringency" thus refers to the conditions in a hybridization reaction that favour the 

10 association of very similar molecules over association of molecules that differ. Conditions can therefore 
exist that allow not only nucleic acid strands with 99-100% complementarity to hybridise, but sequences 
with lower complementarity (for example, 50%) to also hybridise. High stringency hybridisation 
conditions are defined herein as overnight incubation at 42°C in a solution comprising 50% formamide, 
5XSSC (!50mM NaCI, 15mM trisodium citrate), 50mM sodium phosphate (pH7.6), 5x Denhardts 

15 solution, 10% dextran sulphate, and 20 microgram/ml denatured, sheared salmon sperm DNA, followed 
by washing the filters in 0.1 X SSC at approximately 65°C. Low stringency conditions involve the 
hybridisation reaction being carried out at 35°C (see Sambrook et ai [supra]). Preferably, the conditions 
used for hybridization are those of high stringency. 

Some trans- and c/y-acting factors that may affect the binding of two complementary strands include 
20 strand length, base composition (GC pairs have an extra hydrogen bond and are thus require more energy 
to separate than AT pairs) and the chemical environment. The presence of monovalent cations (such as 
Na*) stabilises duplex formation whereas chemical denaturants such as formamide and urea destabilise 
the duplex by disruption of the hydrogen bonds. Use of compounds such as polyethylene glycol (PEG) 
can increase reassociation speeds by increasing overall DNA concentration in aqueous solution by 
25 abstracting water molecules. Denhardt's reagent or BLOTTO are chemical agents often added to block 
non-specific attachment of the liquid phase to the solid support. Increasing the temperature will also 
increase the stringency of hybridisation, as will increasing the stringency of the washing conditions 
following hybridisation (Sambrook et al [supra]). 

Numerous techniques exist for effecting hybridisation of nucleic acid molecules. Such techniques usually 
30 involve one of the nucleic acid populations being labelled. Labelling methods include, but are not limited 
to radiolabelling, fluorescence labelling, chemiluminescent or chromogenic labelling or chemically 
coupling a modified reporter molecule to a nucleotide precursor such as the biotin-streptavidin system. 
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This can be done by oligolabelling, nick-translation, end-labelling or PCR amplification using a labelled 
polynucleotide. Labelling of RNA molecules can be achieved by cloning the sequences encoding (he 
polypeptide of the invention into a vector specifically for this purpose. Such vectors are known in the art 
and may be used to synthesise RNA probes in vitro by the addition of an appropriate RNA polymerase 
5 such as T7, T3 or SP6 and labelled nucleotides. 

Various kits are commercially available that allow the labelling of molecules. Examples include those 
made by Pharmacia & Upjohn (Kalamazoo, MI); Promega (Madison WI); and the U.S. Biochemical 
Corp. (Cleveland, OH). Hybridisation assays include, but are not limited to dot-blots, Southern blotting, 
Northern blotting, chromosome in situ hybridisation (for example, FISH [fluorescence in situ 
10 hybridisation]), tissue in situ hybridisation, colony blots, plaque lifts, gridded clone hybridisation assays, 
DNA microarrays and oligonucleotide microarrays. These hybridisation methods and others, may be used 
by a skilled artisan to isolate copies of genomic DNA, cDNA, or RNA encoding homologous or 
orthologous proteins from other species. 

The invention therefore also embodies a process for detecting a nucleic acid molecule according to the 
15 invention, comprising the steps of: (a) contacting a nucleic probe with a biological sample under 
hybridising conditions to form duplexes: and (b) detecting any such duplexes that are formed. The term 
"probe" as used herein refers to a nucleic acid molecule in a hybridisation reaction whose molecular 
identity is known and is designed specifically to identify nucleic acids encoding homologous genes in 
other species. Usually, the probe population is the labelled population, but this is not always the case, as 
20 for example, in a reverse hybridisation assay. 

One example of a use of a probe is to find nucleic acid molecules with an equivalent function to those that 
are explicitly identified herein, or to identify additional family members in the same or other species. This 
can be done by probing libraries, such as genomic or cDNA libraries, derived from a source of interest, 
such as a human, a non-human animal, other eukaryote species, a plant, a prokaryotic species or a virus. 
25 The probe may be natural or artificially designed using methods recognised in the art (for example, 
Ausubel et a/., [supra]). A nucleic acid probe will preferably possess greater than 15, more preferably 
greater than 30 and most preferably greater than 50 contiguous bases complementary to a nucleic acid 
molecule explicitly identified herein. 

In many cases, isolated DNA from cDNA libraries will be incomplete in the region encoding the 
30 polypeptide, normally at the 5' end. Methods available for subsequently obtaining full-length cDNA 
sequence include RACE (rapid amplification of cDN A ends) as described by Frohman et al, (Proc. Natl. 
Acad. Sci. USA 85, 8998-9002 (1988)), and restriction-site PCR, which uses universal primers to retrieve 



WO 02/46465 



PCT/GB01/05458 



43 

unknown nucleic acid sequence adjacent to a known locus (Sarkar, G. (1993) PCR Methods Applic, 
2:318-322). "Inverse PCR" may also be used to amplify or to extend sequences using divergent primers 
based on a known region (Triglia, T. et a/., (1988) Nucleic Acids Res. 16:8186). Another method which 
may be used is "capture PCR", which involves PCR amplification of DN A fragments adjacent to a known 
5 sequence in human and yeast artificial chromosome DNA (Lagerstrom, M. et a/., (1991) PCR Methods 
Applic., 1:1 11-119). Another method which may be used to retrieve unknown sequences is that of Parker, 
J.D. et al, (1991); Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, 
and libraries, such as the Prom oterFinder™ library (Clontech, Palo Alto, CA) to walk genomic DNA. 
This latter process avoids the need to screen libraries and is useful in finding intron/exon junctions. 

10 When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected to 
include larger cDNAs. Also, random-primed libraries are preferable, in that they will contain more 
sequences that contain the 5* regions of genes. Use of a randomly primed library may be especially 
preferable for situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic 
libraries may be useful for extension of sequence into 5' non-transcribed regulatory regions. 

15 In one embodiment of the invention, a nucleic acid molecule according to the invention may be used for 
chromosome localisation. In this technique, a nucleic acid molecule is specifically targeted to, and can 
hybridise with, a particular location on an individual human chromosome. The mapping of relevant 
sequences to chromosomes is an important step in the confirmatory correlation of those sequences with 
the gene-associated disease. Once a sequence has been mapped to a precise chromosomal location, the 

20 physical position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are found in, for example, McKusick, Mendelian Inheritance in Man (available on-line through Johns 
Hopkins University Welch Medical Library). The relationships between genes and diseases that have 
been mapped to the same chromosomal region are then identified through linkage analysis (coinheritance 
of physically adjacent genes). This provides valuable information to investigators searching for disease 

25 genes using positional cloning or other gene discovery techniques. Once the disease or syndrome has 
been crudely localised by genetic linkage to a particular genomic region, any sequences mapping to that 
area may represent associated or regulatory genes for further investigation. The nucleic acid molecule 
may also be used to detect differences in the chromosomal location due to translocation, inversion, etc. 
among normal, carrier, or affected individuals. 

30 Nucleic acid molecules of the present invention are also valuable for tissue localisation. Such techniques 
facilitate the determination of expression patterns of the polypeptide in tissues by detection of the mRNAs 
that encode them. These techniques include in situ hybridisation techniques and nucleotide amplification 
techniques, such as PCR. Results from these studies provide an indication of the normal functions of the 
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polypeptide in the organism, as well as highlighting the involvement of a particular gene in a disease state 
or abnormal physiological condition. 

In addition, comparative studies of the normal expression pattern of mRNAs with that of mRNAs 
encoded by a mutant gene provide valuable insights into the role of mutant polypeptides in disease. Such 
5 inappropriate expression may be of a temporal, spatial or quantitative nature. 

Vectors 

The nucleic acid molecules of the present invention may be incorporated into vectors for cloning (for 
example, pBluescript made by Stratagene) or expression purposes. Vectors containing a nucleic acid 
molecule explicitly identified herein (or a variant thereof) form another aspect of this invention. The 
10 nucleic acid molecule may be inserted into an appropriate vector by any variety of well known techniques 
such as those described in Sambrook et aL [supra]. Generally, the encoding gene can be placed under the 
control of a control element such as a promoter, ribosome binding site or operator, so that the DNA 
sequence encoding the desired polypeptide is transcribed into RN A in the transformed host cell. 

Vectors may be derived from various sources including, but not limited to bacterial plasm ids, 
15 bacteriophage, transposons, yeast episomes, insertion elements, yeast chromosomal elements, viruses for 
example, baculoviruses and SV40 (simian virus), vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses, lentiviruses and retroviruses, or combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, including cosmids and phagemids. Human, bacterial and 
yeast artificial chromosomes (HACs, BACs and YAGs respectively) may also be employed to deliver 
20 larger fragments of DNA than can be contained and expressed in a plasmid. 

Examples of retroviruses include but are not limited to: murine leukaemia virus (MLV), human 
immunodeficiency virus (HIV), equine infectious anaemia virus (EIAV), mouse mammary tumour virus 
(MMTV), Rous sarcoma virus (RSV), Fujinami sarcoma virus (FuSV), Moloney murine leukaemia virus 
(Mo-MLV), FBR murine osteosarcoma virus (FBR MSV), Moloney murine sarcoma virus (Mo-MSV), 
25 Abelson murine leukaemia virus (A -MLV), Avian myelocyto.matosis virus-29 (MC29), and Avian 
erythroblastosis virus (AEV). A detailed list of retroviruses may be found in Coffin et al ("Retroviruses" 
1997 Cold Spring Harbour Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus pp 758-763). 

Lentiviruses can be divided into primate and non-primate groups. Examples of primate lentiviruses 
include but are not limited to: the human immunodeficiency virus (HIV), the causative agent of human 
30 auto-immunodeficiency syndrome (AIDS), and the simian immunodeficiency virus (SIV). The non- 
primate lentiviral group includes the prototype "slow virus" visna/maedi virus (VMV), as well as the 
related caprine arthritis-encephalitis virus (CAEV), equine infectious anaemia virus (EIAV) and the more 
recently described feline immunodeficiency virus (FIV) and bovine immunodeficiency virus (BIV). 
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A distinction between the lentivirus family and other types of retroviruses is that lentiviruses have the 
capability to infect both dividing and non-dividing ceils (Lewis et at 1992 EMBO. J 1 1: 3053-3058; 
Lewis and Emerman 1994 J. Virol. 68: 510-516). In contrast, other retroviruses - such as MLV - are 
unable to infect non-dividing cells such as those that make up, for example, muscle, brain, lung and liver 
5 tissue. 

A vector may be configured as a split-intron vector. A split intron vector is described in PCT patent 
applications WO 99/15683 and WO 99/15684. 

If the features of adenoviruses are combined with the genetic stability of retroviruses/Ientiviruses then 
essentially the adenovirus can be used to transduce target cells to become transient retroviral producer 
10 cells that could stably infect neighbouring cells. Such retroviral producer cells engineered to express an 
antigen of the present invention can be implanted in organisms such as animals or humans for use in the 
treatment of angiogenesis and/or cancer. 

Poxvirus vectors are also suitable for use in accordance with the present invention. Pox viruses are 
engineered for recombinant gene expression and for the use as recombinant live vaccines. This entails the 

15 use of recombinant techniques to introduce nucleic acids encoding foreign antigens into the genome of 
the pox virus. If the nucleic acid is integrated at a site in the viral DNA which is non-essential for the life 
cycle of the virus, it is possible for the newly produced recombinant pox virus to be infectious, that is to 
say to infect foreign cells and thus to express the integrated DNA sequence. The recombinant pox virus 
prepared in this way can be used as live vaccines for the prophylaxis and/or treatment of pathologic and 

20 infectious disease. 

For vaccine delivery, preferred vectors are vaccinia virus vectors such as MVA or NYVAC. Most 
preferred is the vaccinia strain modified virus ankara (MVA) or a strain derived therefrom. Alternatives 
to vaccinia vectors include avipox vectors such as fowlpox or canarypox known as ALVAC and strains 
derived therefrom which can infect and express recombinant proteins in human cells but are unable to 
25 replicate. 

Bacterial vectors may be also used, such as salmonella, listeria and mycobacteria. 

Vectors containing the relevant nucleotide sequence may enter the host cell by a variety of methods well 
known in the art and described in many standard laboratory manuals (such as Sam brook et ai, [supra], 
Ausubel et a/., [supra], Davis et ai, Basic Methods in Molecular Biology (1986)). Methods include 
30 calcium phosphate transfection, cationic iipid-mediated transfection, DEAE-dextran mediated 
transfeclion, electroporation, microinjection, scrape loading, transduction, and ballistic introduction or 
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infection. 
Host cells 

The choice of host cells is . often dependent on the vector type used as a carrier for the nucleic acid 
molecule of the present invention. Bacteria and other microorganisms are particularly suitable hosts for 
5 plasmids, cosmids and expression vectors generally (for example, vectors derived from the pBR322 
plasmid), yeast are suitable hosts for yeast expression vectors, insect cell systems are suitable host for 
virus expression vectors (for example, baculovirus) and plant cells are suitable hosts for vectors such as 
the cauliflower mosaic virus (CaM V) and tobacco mosaic virus (TM V). Other expression systems include 
using animal cells (for example, with the LentiVectors™, Oxford BioMedica) as a host cell or even using 
10 cell-free translating systems. Some vectors, such as "shuttle vectors" may be maintained in a variety of 
host cells. An example of such a vector would be pEG 202 and other yeast two-hybrid vectors which can 
be maintained in both yeast and bacterial cells (see Ausubel et al, [supra] and Gyuris, J., Cell, 75, 791- 
803). 

Examples of suitable bacterial hosts include Streptococci, Staphylococci, Escherichia coli, Streptomyces 
15 and Bacillus subtilis cells. Yeast and fungal hosts include Saccharomyces cerevisiae and Aspergillus 
cells. Mammalian cell hosts include many immortalised cell lines available from the American Type 
Culture Collection (ATCC) such as CHO (Chinese Hamster Ovary) cells, HeLa cells, BHK (baby hamster 
kidney) cells, monkey kidney cells, C127, 3T3, BHK, HEK 293, Bowes melanoma and human 
hepatocellular carcinoma (for example, Hep G2) cells. Insect host cells that are used for baculovirus 
20 expression include Drosophila S2 and Spodoptera Sf9 cells. Plant host cells include most plants from 
which protoplasts be isolated and cultured to give whole regenerated plants. Practically, all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of sugar cane, 
sugar beet, cotton, fruit and other trees, legumes and vegetables. 

Expression systems 

25 Also included in present invention are expression vectors that comprise a nucleic acid molecule as 
described above. Expression vectors and host cells are preferably chosen to give long term, high yield 
production and stable expression of the recombinant polypeptide and its variants. 

Expression of a polypeptide can be effected by cloning an encoding nucleic acid molecule into a suitable 
expression vector and inserting this vector into a suitable host cell. The positioning and orientation of the 
30 nucleic acid molecule insert with respect to (he regulatory sequences of the vector is important to ensure 
that the coding sequence is properly transcribed and translated. Alternatively, control and other regulatory 
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sequences may be ligated onto the nucleic acid molecule of this invention prior to its insertion into the 
expression vector. In both cases, the sequence of the nucleic acid molecule may have to be adjusted in 
order to effect correct transcription and translation (for example, addition of nucleotides may be necessary 
to obtain the correct reading frame for translation of the polypeptide from its encoding nucleic acid 
5 molecule). 

A nucleic acid molecule of the invention may comprise control sequences that encode signal peptides or 
leader sequences. These sequences may be useful in directing the translated polypeptide to a variety of 
locations within or outside the host cell, such as to the lumen of the endoplasmic reticulum, to the 
nucleus, to the periplasmic space, or into the extracellular environment. Such signals may be endogenous 
10 to the nucleic acid molecules of the invention, or may be a heterologous sequence. These leader or control 
sequences may be removed by the host during post-translational processing. 

A nucleic acid molecule of the present invention may also comprise one or more regulatory sequences 
that allow for regulation of the expression of polypeptide relative to the growth of the host cell. 
Alternatively, these regulatory signals may be due to a heterologous sequence from the vector. Stimuli 

15 that these sequences respond to include those of a physical or chemical nature such as the presence or 
absence of regulatory compounds, changing temperatures or metabolic conditions. Regulatory sequences 
as described herein, are non-translated regions of sequence such as enhancers, promoters and the 5' and 
3' untranslated regions of genes. Regulatory sequences interact with host cellular proteins that carry out 
translation and transcription. These regulatory sequences may vary in strength and specificity. Examples 

20 of regulatory sequences include those of constitutive and inducible promoters. In bacterial systems, an 
example of an inducible promoter is the hybrid lacZ promoter of the Bluescript phagemid (Stratagene, 
LaJolla, CA) or pSportlTM plasmid (Gibco BRL). The baculovirus polyhedrin promoter may be used in 
insect cells. 

An example of a preferred expression system is the lentivirus expression system, for example, as 
25 described in International patent application W 098/17815. 

Detection of uptake of vectors by the host organism 

Various methods are known in the art to detect the uptake of a nucleic acid or vector molecule by a host 
cell and/or the subsequent successful expression of the encoded polypeptide (see for example Sambrook 
etaL, [supra]). 

30 Vectors frequently have marker genes that can be easily assayed. Thus, vector uptake by a host cell can 
be readily detected by testing for the relevant phenotype. Markers include, but are not limited to those 
coding for antibiotic resistance, herbicide resistance or nutritional requirements. The gene encoding 
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dihydrofolate reductase (DHFR) for example, confers resistance to methotrexate (Wigler, M . el al (1980) 
PNAS 77:3567-70) and the gene npt confers resistance to the aminoglycosides neomycin and G-418 
(Coibere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14). Additional selectable genes have been 
described, examples of which will be clear to those of skill in the art. 

5 Markers however, only indicate that a vector has been taken up by a host cell but does not distinguish 
between vectors that contain the desired nucleic acid molecule and those that do not. One method of 
detecting for the said nucleic acid molecule is to insert the relevant sequence at a position that will disrupt 
the transcription and translation of a marker gene. These cells can then be identified by the absence of a 
marker gene phenotype. Alternatively, a marker gene can be placed in tandem with a sequence encoding a 
10 polypeptide of the invention under the control of a single promoter. Expression of the marker gene in 
response to induction or selection usually indicates expression of the tandem gene as well. 

More direct and definitive methods to detect the presence of the nucleic acid molecule of the present 
invention include DNA-DNA or DNA-RNA hybridisation with a probe comprising the relevant antisense 
molecule, as described above. More direct methods to detect polypeptide expression include protein 
15 bioassays for example, fluorescence activated cell sorting (FACS), immunoassay techniques such as 
ELISA or radioimmunoassays. 

Alternative methods for detecting or quantitating the presence of the nucleic acid molecule or polypeptide 
of this invention include membrane, solution or chip-based technologies (see Hampton, R. et al, (1990) 
Serological Methods, a Laboratory Manual, APS Press, St Paul, MN) and Maddox, D.E. et al, (1983) J. 
20 Exp. Med, 158, 121 1-1216). 

Transgenic animals 

In another embodiment of this invention, a nucleic acid molecule according to the invention may be used 
to create a transgenic animal, most commonly a rodent. The modification of the animal's genome may 
either be done locally, by modification of somatic cells or by germ line therapy to incorporate inheritable 
25 modifications. Such transgenic animals may be particularly useful in the generation of animal models for 
drug molecules effective as modulators of the polypeptides of the present invention. 

Polypeptide purification 

A polypeptide according to the invention may be recovered and purified from recombinant cell cultures 
by methods including, but not limited to cell lysis techniques, ammonium sulphate precipitation, ethanol 
30 precipitation, acid extraction, anion or cation chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
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lectin chromatography, high performance liquid chromatography (HPLC) or fast performance liquid 
chromatography (FPLC). The polypeptide may need refolding after purification or isolation and many 
well known techniques are available that will help regenerate an active polypeptide conformation. 

Many expression vectors are commercially available that aid purification of the relevant polypeptide. 
5 These include vectors that join the sequence encoding the polypeptide to another expressed sequence 
creating a fused protein that is easier to purify. Ways in which these fused parts can facilitate purification 
of the polypeptide of this invention include fusions that can increase the solubility of the polypeptide, 
joining of metal chelating peptides (for example, histidine-tryptophan modules) that allow for purification 
with immobilised metals, joining of protein A domains which allow for purification with immobilised 
10 immunoglobulins and the joining of the domain that is utilised in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, WA). Fusion of the polypeptide of this present invention 
with a secretion signal polypeptide may also aid purification. This is because the medium into which the 
fused polypeptide has been secreted can subsequently be used to recover and purify the expressed 
polypeptide. 

15 If necessary, these extraneous polypeptides often comprise a cleavable linker sequence which allows the 
polypeptide to be isolated from the fusion. Cleavable linker sequences between the purification domain 
and the polypeptide of the invention include those specific for Factor Xa or for enterokinase (Invitrogen, 
San Diego, CA). One such expression vector provides for expression of a fusion protein containing the 
polypeptide of the invention fused to several histidine residues preceding a thioredoxin or an enterokinase 

20 cleavage site. The histidine residues facilitate purification by IMAC (immobilised metal ion affinity 
chromatography as described in Porath, J. et al. (1992), Prot. Exp. Purif. 3: 263-281), while the 
thioredoxin or enterokinase cleavage site provides a means for purifying the polypeptide from the fusion 
protein. A discussion of vectors that contain fusion proteins is provided in Kroll, D.J. et al (1993; DNA 
Cell Biol. 12:441-453). 

25 Assays 

Another aspect of this invention includes assays that may be carried out using a polypeptide or nucleic 
acid molecule according to the invention. Such assays may be for many uses including the development 
of drug candidates, for diagnostic purposes or for the gathering of information for therapeutics. 

If the polypeptide is to be expressed for use in screening assays, generally it is preferred that it be 
30 produced at the surface of the host cell in which it is expressed. In this event, the host cells may be 
harvested prior to use in the screening assay, for example using techniques such as fluorescence activated 
cell sorting (FACS) or immunoaffinity techniques. If the polypeptide is secreted into the medium, the 



WO 02/46465 



PCT/G B0 1/05458 



50 

medium can be recovered in order to recover and purify the expressed polypeptide. If polypeptide is 
produced intracellularly, the cells must first be lysed before the polypeptide is recovered. 

The polypeptide of the invention can be used to screen libraries of compounds in any of a variety of drug 
screening techniques. Such compounds may activate (agonise) or inhibit (antagonise) the level of 
5 expression of the gene or the activity of the polypeptide of the invention and form a further aspect of the 
present invention. Examples of suitable compounds are those which are effective to alter the expression 
of a natural gene which encodes a polypeptide of the invention or to regulate the activity of a polypeptide 
of the invention. 

Agonist or antagonist compounds may be isolated from, for example, cells, cell-free preparations, 
10 chemical libraries or natural product mixtures. These agonists or antagonists may be natural or modified 
substrates, ligands, enzymes, receptors or structural or functional mimetics. For a suitable review of such 
screening techniques, see Coligan et al., Current Protocols in Immunology l(2):Chapter 5 (1991). 

Potential agonists or antagonists include small organic molecules, peptides, polypeptides and antibodies 
that bind to the polypeptide of the invention and thereby modulate its activity. In this fashion, binding of 
15 the polypeptide to normal cellular binding molecules may be potentiated or inhibited, such that the 
normal biological activity of the polypeptide is enhanced or prevented. 

The polypeptide of the invention that is employed in such a screening technique may be free in solution, 
affixed to a solid support, borne on a cell surface or located intracellularly. In general, such screening 
procedures may involve using appropriate cells or cell membranes that express the polypeptide that are 

20 contacted with a test compound to observe binding, or stimulation or inhibition of a functional response. 
The functional response of the cells contacted with the test compound is then compared with control cells 
that were not contacted with the test compound. Such an assay may assess whether the test compound 
results in a signal generated by activation of the polypeptide, using an appropriate detection system. 
Inhibitors of activation are generally assayed in the presence of a known agonist and the effect on 

25 activation by the agonist in the presence of the test compound is observed. 

Alternatively, simple binding assays may be used, in which the adherence of a test compound to a surface 
bearing the polypeptide is detected by means of a label directly or indirectly associated with the test 
compound or in an assay involving competition with a labelled competitor. In another embodiment, 
competitive drug screening assays may be used, in which neutralising antibodies that are capable of 
30 binding the polypeptide specifically compete with a test compound for binding. In this manner, the 
antibodies can be used to detect the presence of any test compound that possesses specific binding affinity 
for the polypeptide. 
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Assays may also be designed to detect the effect of added test compounds on the production of mRNA 
encoding the polypeptide in cells. For example, an ELISA may be constructed that measures secreted or 
• cell-associated levels of polypeptide using monoclonal or polyclonal antibodies by standard methods 
known in the art, and this can be used to search for compounds that may inhibit or enhance the production 
5 of the polypeptide from suitably manipulated cells or tissues. The formation of binding complexes 
between the polypeptide and the compound being tested may then be measured. 

Another technique for drug screening which may be used provides for high throughput screening of 
compounds having suitable binding affinity to the polypeptide of interest (see International patent 
application WO84/03564). In this method, large numbers of different small test compounds are 
10 synlhesised on a solid substrate, which may then be reacted with the polypeptide of the invention and 
washed. One way of immobilising the polypeptide is to use non-neutralising antibodies. Bound 
polypeptide may then be detected using methods that are well known in the art. Purified polypeptide can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 

A polypeptide according to the invention may be used to identify membrane-bound or soluble receptors, 
15 through standard receptor binding techniques that are known in the art, such as ligand binding and 
crosslinking assays in which the polypeptide is labelled with a radioactive isotope, is chemically 
modified, or is fused to a peptide sequence that facilitates its detection or purification, and incubated with 
a source of the putative receptor (for example, a composition of cells, cell membranes, cell supernatants, 
tissue extracts, or bodily fluids). The efficacy of binding may be measured using biophysical techniques 
20 such as surface plasmon resonance and spectroscopy. Binding assays may be used for the purification and 
cloning of the receptor, but may also identify agonists and antagonists of the polypeptide, that compete 
with the binding of the polypeptide to its receptor. Standard methods for conducting screening assays are 
well understood in the art. 

A typical polypeptide-based assay might involve contacting the appropriate cell(s) or cell membrane(s) 
25 expressing the polypeptide with a test compound. In such assays, a polypeptide according to the invention 
may be free in solution, affixed to a solid support, borne on a cell surface or located intracellular^. Any 
response to the test compound, for example a binding response, a stimulation or inhibition of a functional 
response may then be compared with a control where the celi(s) or cell membrane(s) was/were not 
contacted with the test compound. 

30 A binding response could be measured by testing for the adherence of a test compound to a surface 
bearing a polypeptide according to the invention. The test compound may aid polypeptide detection by 
being labelled, either directly or indirectly. Alternatively, the polypeptide itself may be labelled, for 
example, with a radioisotope, by chemical modification or as a fusion with a peptide or polypeptide 
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sequence that will facilitate polypeptide detection. Alternatively, a binding response may be measured, 
for example, by performing a competition assay with a labelled competitor or vice versa. One example of 
such a technique is a competitive drug screening assay, where neutralising antibodies that are capable of 
specifically binding to the polypeptide compete with a test compound for binding. In this manner, the 
5 antibodies may be used to detect the presence of any test compound that possesses specific binding 
affinity for the polypeptide. Alternative binding assay methods are well known in the art and include, but 
are not limited to, cross-linking assays and filter binding assays. The efficacy of binding may be 
measured using biophysical techniques including surface plasmon resonance and spectroscopy. 

High throughput screening is a type of assay which enables a large number of compounds to be searched 
10 for any significant binding activity to the polypeptide of interest (see patent application WO84/03564). 
This is particularly useful in drug screening. In this scenario, many different small test compounds are 
synthesised on to a solid substrate. The polypeptide is then introduced to this substrate and the whole 
apparatus washed. The polypeptide is then immobilised by, for example, using non-neutralising 
antibodies. Bound polypeptide may then be detected using methods that are well known in the art. 
15 Purified polypeptide may also be coated directly onto plates for use in the aforementioned drug screening 
techniques. 

Assay methods that are also included within the terms of the present invention are those that involve the 
use of the genes and polypeptides of the invention in overexpression or ablation assays. Such assays 
involve the manipulation of levels of these genes/polypeptides in cells and assessment of the impact of 
20 this manipulation event on the physiology of the manipulated cells. For example, such experiments reveal 
details of signaling and metabolic pathways in which the particular genes/polypeptides are implicated, 
generate information regarding the identities of polypeptides with which the studied polypeptides interact 
and provide clues as to methods by which related genes and proteins are regulated. 

Another aspect of this invention provides for any screening kits that are based or developed from any of 
25 the aforementioned assays. 

C. Pharmaceuticals 

A further aspect of the invention provides a pharmaceutical composition suitable for modulating hypoxia 
and/or ischaemia, comprising a therapeutically-effective amount of a polypeptide, a nucleic acid 
molecule, vector or ligand as described above, in conjunction with a pharmaceuticaiiy-acceptable carrier. 
30 A composition containing a polypeptide, nucleic acid molecule, ligand or any other compound of this 
present invention (herein known as X) is considered to be "substantially free of impurities" (herein known 
as Y) when X makes up more than 85% mass per mass of the total [X+Y] mass. Preferably X comprises 
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at least 90% of the total X-f Y mass. More preferably X comprises at least 95%, 98% and most preferably 
99% of the total X+Y mass. 

Carriers 

Carrier molecules may be genes, polypeptides, antibodies, liposomes or indeed any other agent provided 
5 that the carrier does not itself induce toxicity effects or cause the production of antibodies that are harmful 
to the individual receiving the pharmaceutical composition. Further examples of known carriers include 
polysaccharides, polylactic acids, polyglycolic acids and inactive virus particles. Carriers may also 
include pharmaceutical^ acceptable salts such as mineral acid salts (for example, hydrochlorides-, 
hydrobromides, phosphates, sulphates) or the salts of organic acids (for example, acetates, propionates, 

10 malonates, benzoates). Pharmaceutical^ acceptable carriers may additionally contain liquids such as 
water, saline, glycerol, ethanol or auxiliary substances such as wetting or emulsifying agents, pH 
buffering substances and the like. Carriers may enable the pharmaceutical compositions to be formulated 
into tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions to aid intake by the patient. 
A thorough discussion of pharmaceutical^ acceptable carriers is available in Remington's Pharmaceutical 

15 Sciences (Mack Pub. Co., N.J. 1991). 

Dosage 

The amount of component X in the composition should also be in therapeutically effective amounts. The 
phrase "therapeutically effective amounts" used herein refers to the amount of agent needed to treat, 
ameliorate, or prevent (for example, when used as a vaccine) a targeted disease or condition. An effective 

20 initial method to determine a "therapeutically effective amount" may be by carrying out cell culture 
assays (for example, using neoplastic cells) or using animal models (for example, mice, rabbits, dogs or 
pigs). In addition to determining the appropriate concentration range for X to be therapeutically effective, 
animal models may also yield other relevant information such as preferable routes of administration that 
will give maximum effectiveness. Such information may be useful as a basis for patient administration. A 

25 "patient" as used in herein refers to the subject who is receiving treatment by administration of X. 
Preferably, the patient is human, but the term may also include animals. 

The therapeutically-effective dosage will generally be dependent on the patient's status at the time of 
adminstration. Factors that may be taken into consideration when determining dosage include the severity 
of the disease state in the patient, the general health of the patient, the age, weight, gender, diet, time and 
30 frequency of administration, drug combinations, reaction sensitivities and the patient's tolerance or 
response to the therapy. The precise amount can be determined by routine experimentation but may 
ultimately lie with the judgement of the clinician. Generally, an effective dose will be from 0.01 mg/kg 
(mass of drug compared to mass of patient) to 50 mg/kg, preferably 0.05 mg/kg to 10 mg/kg. 
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Compositions may be administered individually to a patient or may be administered in combination with 
other agents, drugs or hormones. 

Routes of administration 

Uptake of a pharmaceutical composition of the invention by a patient may be initiated by a variety of 
5 methods including, but not limited to enteral, intra-arterial, intrathecal, intramedullary, intramuscular, 
intranasal, intraperitoneal, intravaginal, intravenous, intraventricular, oral, rectal (for example, in the form 
of suppositories), subcutaneous, sublingual, transcutaneous applications (for example, see W 098/20734) 
or transdermal means. 

Gene guns or hyposprays may also be used to administer the pharmaceutical compositions of the 
10 invention. Typically, the therapeutic compositions may be prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Direct delivery of the compositions can generally be accomplished by 
injection, subcutaneonsly, intraperitoneally, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Dosage treatment 
15 may be a single dose schedule or a multiple dose schedule. 

Inhibition of excessive activity 

If a particular disease state is partially or completely caused by an inappropriate excess in the activity of a 
polypeptide according to the invention, several approaches are available for inhibiting this activity. 
One approach comprises administering to a patient an inhibitor compound (antagonist) along with a 
20 pharmaceutical^ acceptable carrier in an amount effective to inhibit the function of the polypeptide, such 
as by blocking the binding of a ligand, substrate, enzyme, receptor, or by inhibiting a second signal, and 
thereby alleviating the abnormal condition. Such an antagonist molecule may, for example, be an 
antibody. Most preferably, such antibodies are chimeric and/or humanised to minimise their 
immunogenicity, as previously described. 

25 In another approach, soluble forms of the polypeptide that retain binding affinity for the ligand, substrate, 
enzyme, receptor, in qoestion, may be administered to the patient to compete with the biological activity 
of the endogenous polypeptide. Typically, the polypeptide may be administered in the form of a fragment 
that retains a portion that is relevant for the desired biological activity. 

In an alternative approach, expression of the gene encoding the polypeptide can be inhibited using 
30 expression blocking techniques, such as by using antisense nucleic acid molecules (as described above), 
either internally generated or separately administered. Modifications of gene expression may be effected 
by designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5' or 
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regulatory regions (signal sequence, promoters, enhancers and introns) of the gene encoding the 
polypeptide. Similarly, inhibition can be achieved using "triple helix" base-pairing methodology. Triple 
helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for 
the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
5 using triplex DNA have been described in the literature (Gee, J.E. et al (1994) In: Huber, B.E. and B.l. 
Carr, Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, NY). The 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. Such oligonucleotides may be administered or may 
be generated in situ from expression in vivo. 

10 Gene silencing approaches may also be undertaken to down-regulate endogenous expression of a gene. 
RNA interference (RNAi) (Elbashir, SM et al., Nature 2001, 411, 494-498) is one method of sequence 
specific post-transcriptional gene silencing that may be employed. Short dsRNA oligonucleotides are 
synthesised in vitro and introduced into a cell. The sequence specific binding of these dsRNA 
oligonucleotides triggers the degradation of target mRNA, reducing or ablating target protein expression. 

15 In addition, expression of a polypeptide according to the invention may be prevented by using a ribozyme 
specific to the encoding mRNA sequence for the polypeptide. Ribozymes are catalytically active RNAs 
that can be natural or synthetic (see for example Usman, N, et ai t Curr. Opin. Struct. Biol (1996) 6(4), 
527-33). Synthetic ribozymes can be designed to specifically cleave mRNAs at selected positions thereby 
preventing translation of the mRNAs into functional polypeptide. Ribozymes may be synthesised with a 

20 natural ribose phosphate backbone and natural bases, as normally found in RNA molecules. Alternatively 
the ribozymes may be synthesised with non-natural backbones, for example, 2' 0-methyl RNA, to 
provide protection from ribonuclease degradation and may contain modified bases. 

Efficacy of the gene silencing approaches assessed above may be assessed through the measurement of 
polypeptide expression (for example, by Western blotting), and at the RNA level using TaqMan-based 
25 methodologies. 

RNA molecules may be modified to increase their intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends of 
the molecule or the use of phosphorothioate or V O-methyl rather than phosphodiesterase linkages within 
the backbone of the molecule. This concept is inherent in the production of PNAs and can be extended in 
30 all of these molecules by the inclusion of non-traditional bases such as inosine, queosine and butosine, as 
well as acetyl-, methyl-, thio- and similarly modified forms of adenine, cytidine, guanine, thymine and 
uridine that are not as easily recognised by endogenous endonucleases. 
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Activation of a polypeptide activity 

If a particular disease state is partially or completely due to a lowered level of biological activity from a 
polypeptide according to the invention, various methods may be used. An example of such a method 
includes administering a therapeutically effective amount of compound that can activate (i.e. an agonist) 
5 or cause increased expression of the polypeptide concerned. Administration of such a compound may be 
via any of the methods described previously. 

Gene Therapy 

Another aspect of the present invention provides for gene therapy methods involving nucleic acid 
molecules identified herein. Gene therapy may be used to affect the endogenous production of the 
10 polypeptide of the present invention by relevant cells in a patient. For example, gene therapy can be used 
permanently to treat the inappropriate production of a polypeptide by replacing a defective gene with the 
corrected therapeutic gene. 

Treatment may be effected either in vivo or ex vivo. Ex vivo gene therapy generally involves the isolation 
and purification of the patient's cells, introduction of the therapeutic gene into the cells and finally, the 

15 introduction of the genetically-altered cells back into the patient. In vivo gene therapy does not require the 
isolation and purification of patient cells prior to the introduction of the therapeutic gene into the patient. 
Instead, the therapeutic gene can be packaged for delivery into the host. Gene delivery vehicles for in vivo 
gene therapy include, but are not limited to, non-viral vehicles such as liposomes, replication-competent 
and replication-deficient viruses (for example, adenovirus as described by Berkner, K.L., in Curr. Top. 

20 Microbiol. Immunol., 158, 39-66 (1992)) or adeno-associated virus (AAV) vectors as described by 
Muzyczka, N., in Curr. Top. Microbiol. Immunol., 158, 97-129 (1992) and U.S. Patent No. 5,252,479. 
Alternatively, "naked DNA" may be directly injected into the bloodstream or muscle tissue as a form of 
in vivo gene therapy. 

One example of a strategy for gene therapy including a nucleic acid molecule of this present invention 
25 may be as follows. A nucleic acid molecule encoding a polypeptide of the invention is engineered for 
expression in a replication-defective or replication-competent vector, such as a retroviral vector. This 
expression construct may then be isolated and introduced into a packaging cell transduced with a 
retroviral plasmid vector containing RNA encoding the polypeptide, such that the packaging cell now 
produces infectious viral particles containing the' gene of interest. These producer cells may be 
30 administered to a patient for engineering cells in vivo and expression of the polypeptide in vivo (see 
Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, (and references 
cited therein) in Human Molecular Genetics (1996), T Strachan and A P Read, BIOS Scientific Publishers 
Ltd). 
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Genelic delivery of antibodies that bind to polypeptides according to the invention may also be effected, 
for example, as described in International patent application W 098/55607. 

Vaccines 

A further embodiment of the present invention provides that the polypeptides or nucleic acid molecules 
5 identified may be used in the development of vaccines. Where the aforementioned polypeptide or nucleic 
acid molecule is a disease-causing agent, vaccine development can involve the raising of antibodies 
against such agents. Where the aforementioned polypeptide or nucleic acid molecule is that is 
upregulated, vaccine development can involve the raising of antibodies or T cells against such agents (as 
described in W 000/29428). 

10 Vaccines according to the invention may either be prophylactic (i.e. prevents infection) or therapeutic (i.e. 
treats disease after infection). Such vaccines comprise immunising antigen(s), immunogen(s), 
polypeptide(s), protein(s) or nucleic acid, usually in combination with pharmaceutically-acceptable 
carriers as described above. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the antigen or immunogen may be conjugated to a bacterial toxoid, such as a 

15 . toxoid from diphtheria, tetanus, cholera, H. pylori, and other pathogens. 

Vaccination .processes may involve the use of heterologous vectors eg: prime with MVA and boost with 
DNA. 

Since polypeptides may be broken down in the stomach, vaccines comprising polypeptides are preferably 
administered parenterally (for instance, subcutaneous, intramuscular, intravenous, or intradermal 
20 injection). Formulations suitable for parenteral administration include. aqueous and non-aqueous sterile 
injection solutions that may contain anti-oxidants, buffers, bacteriostats and solutes which render the 
formulation isotonic with the blood of the recipient, and aqueous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. 

The vaccine formulations of the invention may be presented in unit-dose or multi-dose containers. For 
25 example, sealed ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. The dosage will depend on the specific 
activity of the vaccine and can be readily determined by routine experimentation. 

The technology referred to as jet injection (see, for example, www.powderject.com) may also be useful in 
the formulation of vaccine compositions. 

30 In accordance with this aspect of the present invention, polypeptides can be delivered by viral or non-viral 
techniques. Non-viral delivery systems include but are not limited to DNA transfection methods. Here, 
transfection includes a process using a non-viral vector to deliver a antigen gene to a target mammalian 
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cell. Typical transfeclion methods include electroporation, nucleic acid biolistics, lipid-medialed 
transfeclion, compacted nucleic acid-mediated transfeclion, liposomes, immunoliposomes. lipofectin, 
cationic agent-mediated, cationic facial amphiphiles (CFAs) (Nature Biotechnology 1996 14; 556), 
multivalent cations such as spermine, cationic lipids or polylysine, 1, 2,-bis (oleoyloxy)-3- 
5 (trimethylammonio) propane (DOTAP)-choIesterol complexes (Wolff and Trubetskoy 1998 Nature 
Biotechnology 16: 421) and combinations thereof. 

Viral delivery systems include but are not limited to adenovirus vectors, adeno-associated viral (AAV) 
vectors, herpes viral vectors, influenza, retroviral vectors, lentiviral vectors or baculoviral vectors, 
Venezuelan equine encephalitis virus (VEE), poxviruses such as: canarypox virus (Taylor et al 1995 
10 Vaccine 13:539-549), entomopox virus (Li Y et al 1998 Xllth International Poxvirus Symposium p 1 44. 
Abstract), penguine pox (Standard et al. J Gen Virol. 1998 79:1637-46) alphavirus, and alphavirus based 
DNA vectors. 

In addition to the use of polypeptide-based vaccines, this aspect of the invention includes the use of 
genetically-based vaccines, for example, those vaccines that are effective through eliciting the expression 
15 of a particular gene (either endogenous or exogenously derived) in a cell, so targeting this cell for 
destruction by the immune system of the host organism. 

A number of suitable methods for vaccination and vaccine delivery systems are described in International 
patent application W 000/29428. 

D. Diagnostics 

20 Another aspect of the present invention provides for the use of a nucleic acid molecule identified herein 
as a diagnostic reagent. 

For example, a nucleic acid molecule may be detected or isolated from a patient's tissue and used for 
diagnostic purposes. "Tissue" as defined herein refers to blood, urine, any matter obtained from a tissue 
biopsy or any matter obtained from an autopsy. Genomic DNA from the tissue sample may be used 

25 directly for detection of a hypoxia-related condition. Alternatively, the DNA may be amplified using 
methods such as polymerase chain reaction (PCR), the ligase chain reaction (LCR), strand displacement 
amplification (SDA), or other amplification techniques (see Saiki et ai t Nature, 324, 163-166 (1986); 
Bej, et fl/., Crit. Rev. Biochem. Molec. Biol., 26, 301-334 (1991); Birkenmeyer et a/., J. Virol. Meth., 35, 
117-126 (1991) and Brunt, J., Bio/Technology, 8, 291-294 (1990)). Such diagnostics are particularly 

30 useful for prenatal and even neonatal testing. 

A method of diagnosis of disease using a polynucleotide may comprise assessing the level of expression 
of the natural gene and comparing the level of encoded polypeptide to a control level measured in a 
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normal subject that does not suffer from the disease or physiological condition that is being tested. The 
diagnosis may comprise the following steps: 

a) contacting a sample of tissue from the patient with a nucleic acid probe under stringent conditions 
that allow the formation of a hybrid complex between a nucleic acid molecule of the invention and 

5 the probe; 

b) contacting a control sample with said probe under the same conditions used in step a); and 

c) detecting the presence of hybrid complexes in said samples; 

wherein detection of differing levels of the hybrid complex in the patient sample compared to levels of 
the hybrid complex in the control sample is indicative of the dysfunction". 

10 A further aspect of the invention comprises a diagnostic method comprising the steps of: 

a) obtaining a tissue sample from a patient being tested for disease; 

b) isolating a nucleic acid molecule according to the invention from said tissue sample; and 

c) diagnosing the patient for disease by detecting the presence of a mutation in the nucleic acid 
molecule which is associated with disease. 

15 To aid the detection of nucleic acid molecules in the above-described methods, an amplification step, 
such as PCR, may be included. An example of this includes detection of deletions or insertions indicative 
of the dysfunction by a change in the size of the amplified product in comparison to the normal genotype. 
Point mutations can be identified by hybridising amplified DNA to labelled RNA of the invention or 
alternatively, labelled antisense DNA sequences of the invention. Perfectly matched sequences can be 

20 distinguished from mismatched duplexes by RNase digestion or by assessing differences in melting 
temperatures. The presence or absence of the mutation in the patient may be detected by contacting DNA 
with a nucleic acid probe that hybridises to the DNA under stringent conditions to form a hybrid double- 
stranded molecule, the hybrid double-stranded molecule having an unhybridised portion of the nucleic 
acid probe strand at any portion corresponding to a mutation associated with disease; and detecting the 

25 presence or absence of an unhybridised portion of the probe strand as an indication of the presence or 
absence of a disease-associated mutation in the corresponding portion of the DNA strand. 

Point mutations and other sequence differences between the reference gene and "mutant" genes can be 
identified by other well-known, techniques, such as direct DNA sequencing or single-strand 
conformational polymorphism, (see Orila el a/., Genomics, 5, 874-879 (1989)). For example, a 
30 sequencing primer may be used with double-stranded PCR product or a single-stranded template 
molecule generated by a modified PCR. The sequence determination is performed by conventional 
procedures with radiolabeled nucleotides or by automatic sequencing procedures with fluorescent-tags. 
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Cloned DNA segments may also be used as probes to detect specific DNA segments. The sensitivity of 
this method is greatly enhanced when combined with PCR. Further, point mutations and other sequence 
variations, such as polymorphisms, can be detected as described above, for example, through the use of 
aliele-specific oligonucleotides for PCR amplification of sequences that differ by single nucleotides. 

5 DNA sequence differences may also be detected by alterations in the electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing (for example, Myers 
et ai, Science (1985) 230:1242). Sequence changes at specific locations may also be revealed by nuclease 
protection assays, such as RNase and SI protection or the chemical cleavage method (see Cotton et ai, 
PNAS.USA (1985) 85:4397-4401). 

10 In addition to conventional gel electrophoresis and DNA sequencing, mutations such as microdeletions, 
aneuploidies, translocations, inversions, can also be detected by in situ analysis (see, for example, Keller 
et ai, DNA Probes, 2nd Ed., Stockton Press, New York, N.Y., USA (1993)), that is, DNA or RNA 
sequences in cells can be analysed for mutations without need for their isolation and/or immobilisation 
onto a membrane. FISH is presently the most commonly applied method and numerous reviews of FISH 

15 have appeared (see, for example, Trachuck et ai, Science, 250, 559-562 (1 990), and Trask et ai, Trends, 
Genet., 7, 149-154 (1991)). 

Arrays 

In another embodiment of the invention, an array of oligonucleotide probes comprising a nucleic acid 
molecule according to the invention can be constructed to conduct efficient screening of genetic variants, 
20 mutations and polymorphisms. Array technology methods are well known and have general applicability 
and can be used to address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see for example: M.Chee et d.,.Science (1996), Vol 274, pp 610-613). 

In one embodiment, the array is prepared and used according to the methods described in W095/1 1995 
(Chee et al)\ Lockhart, D. J. et ai (1996) Nat. Biotech. 14: 1675-1680); and Schena, M. et ai (1996) 

25 PNAS 93: 10614-10619). Oligonucleotide pairs may range from two to over one million. The oligomers 
are synthesized at designated areas on a substrate using a light-directed chemical process. The substrate 
may be paper, nylon or other type of membrane, filter, chip, glass slide or any other suitable solid support. 
In another aspect, an oligonucleotide may be synthesized on the surface of the substrate, by using a 
chemical coupling procedure and an ink jet application apparatus, as described in PCT application 

30 W095/25 1116 (Baldeschweiler et al). In another aspect, a "gridded" array analogous to a dot (or slot) blot 
may be used to arrange and link cDN A fragments or oligonucleotides to the surface of a substrate using a 
vacuum system, thermal, UV, mechanical or chemical bonding procedures. An array, such as those 
described above, may be produced by hand or by using available devices (slot blot or dot blot apparatus), 
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materials (any suitable solid support), and machines (including robotic instruments), and may contain 8, 
24, 96, 384, 1536 or 6144 oligonucleotides, or any other number between two and over one million which 
lends itself to the efficient use of commercially-available instrumentation. 

Diagnostics using polypeptides or mRNA 

5 In addition to the methods discussed above, diseases may be diagnosed by methods comprising 
determining, from a sample derived from a subject, an abnormally decreased or increased level of 
polypeptide or mRNA. Decreased or increased expression can be measured at the RNA level using any of 
the methods well known in the art for the quantitation of polynucleotides, such as, for example, nucleic 
acid amplification, for instance PCR, RT-PCR, RNase protection, Northern blotting and other 
10 hybridization methods. 

Assay techniques that can be used to determine levels of a polypeptide of the present invention in a 
sample derived from a host are well-known to those of skill in the art and are discussed in some detail 
above (including radioimmunoassays, competitive-binding assays, Western Blot analysis and ELISA 
assays). One example of this aspect of the invention provides a diagnostic method which comprises the 
15 steps of: (a) contacting a ligand as described above with a biological sample under conditions suitable for 
the formation of a ligand-polypeptide complex; and (b) detecting said complex. 

Protocols such as ELISA, RIA, and FACS for measuring polypeptide levels may additionally provide a 
basis for diagnosing altered or abnormal levels of polypeptide expression. Normal or standard values for 
polypeptide expression are established by combining body fluids or cell extracts taken from normal 
20 mammalian subjects, preferably humans, with antibody to the polypeptide under conditions suitable for 
complex formation The amount of standard complex formation may be quantified by various methods, 
such as by photometric means. 

Antibodies which specifically bind to a polypeptide of the invention may be used for the diagnosis of 
conditions or diseases characterised by expression of the polypeptide, or in assays to monitor patients 

25 being treated with the polypeptides, nucleic acid molecules, ligands and other compounds of the 
invention. Antibodies useful for diagnostic purposes may be prepared in the same manner as those 
described above for therapeutics. Diagnostic assays for the polypeptide include methods that utilise the 
antibody and a label to detect the polypeptide in human body fluids or extracts of cells or tissues. The 
antibodies may be used with or without modification, and may be labelled by joining them, either 

30 covalently or non-covalently, with a reporter molecule. A wide variety of reporter molecules known in the 
art may be used, several of which are described above. 

Quantities of polypeptide expressed in subject, control and disease samples from biopsied tissues are 
compared with the standard values. Deviation between standard and subject values establishes the 
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parameters for diagnosing disease. Diagnostic assays may be used to distinguish between absence, 
presence, and excess expression of polypeptide and to monitor regulation of polypeptide levels during 
therapeutic intervention. Such assays may also be used to evaluate the efficacy of a particular therapeutic 
treatment regimen in animaljtudies, in clinical trials or in monitoring the treatment of an individual 
5 patient. 

Diagnostic kits 

A diagnostic kit of the present invention may comprise: 

(a) a nucleic acid molecule of the present invention; 

(b) a polypeptide of the present invention; or 

10 (c) a ligand of the present invention. 

In one aspect- of the invention, a diagnostic kit may comprise a first container containing a nucleic acid 
probe that hybridises under stringent conditions with a nucleic acid molecule according to the invention; a 
second container containing primers useful for amplifying the nucleic acid molecule; and instructions for 
using the probe and primers for facilitating the diagnosis of disease. The kit may further comprise a third 
15 container holding an agent for digesting unhybridised RNA. 

In an alternative aspect of the invention, a diagnostic kit may comprise an array of nucleic acid molecules, 
an array of antibody molecules, and/or an array of polypeptide molecules, as discussed in more detail 
above. 

Such kits will be of use in diagnosing a disease or susceptibility to disease, particularly inflammation, 
20 oncology, or cardiovascular disease. 

Various aspects and embodiments of the present invention will now be described in more detail by way of 
example, with particular reference to polypeptides regulated differentially under hypoxic conditions as 
opposed to normoxic conditions. It will be appreciated that modification of detail may be made without 
departing from the scope of the invention. 

25 Brief description of the Figures 

Figure 1 shows a scatter plot, showing normalised signal intensities in hypoxia versus normoxia, with 
each dot representing a single gene. 

Figure 2: Hypoxia responses amplified by HIF1 alpha overexpression. Data shown is the average of 4 
repeat experiments. Values represent fold change as compared to untreated cells (condition 1). Error bars 
30 represent standard error of the mean. 
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Figure 3: Hypoxia responses amplified by EPAS1 overexpression. Data shown is the average of 4 repeat 
experiments. Values represent fold change as compared to untreated cells (condition 1). Error bars 
represent standard error of the mean. 

Figure 4: Hypoxia responses amplified by H IFI alpha / EPAS1 overexpression. Data shown is the average 
5 of 4 repeat experiments. Values represent fold change as compared to untreated cells (condition 1). Error 
bars represent standard error of the mean. 

Figure 5 shows genes that are induced by hypoxia to a greater degree in resting macrophages, as 
compared to activated macrophages. Error bars show the standard deviation from both repeat experiments 
and multiple exposures from single experiments. These data are not shown in table form. All bars are 
10 ratios of mRNA expression in hypoxia/ normoxia. These are calculated separately for resting (light bars) 
and activated (dark bars) macrophages, and do not illustrate differences resulting from activation in 
normoxia. 

Figure 6 shows genes which are induced by hypoxia to a greater degree in activated macrophages, 
compared to resting macrophages. 

15 Figure 7 shows genes that are repressed by hypoxia to a greater degree in activated macrophages. 

For Figures 8, 9a, 9c, I0-32a, 32d and 33-52, mRNA levels, determined from a custom gene array, of 
particular genes are shown on the Y-axis, expressed as a value as compared to the median expression 
level of this gene throughout all samples. Eleven primary human cell types as shown on the x-axis were 
cultured in normoxia (black), or exposed to hyopxia for 6hr (grey) or 1 8hr (white). 

20 Figure 8: Ecotropic viral integration site 2A (Seq ID:475/476). 

Figure 9a: Novel PI-3-kinase adapter (Seq ID:79/80); Image clone accession R62339. 

Figure 9b: TaqMan Real-time Q-RT-PCR data for Novel PI-3-kinase adapter (Seq ID:79/80); Image 
clone accession R62339. 

Figure 9c: IMAGE clone acc R59598 (Syk). 
25 Figure 10: Regulator of G-protein signalling 1 (Seq ID.375/376) 

Figure 1 1 : GM 2 ganglioside activator protein (Seq ID:389/390) 

Figure 12: Hypothetical protein PRO0823 (Seq 10:21/22) 

Figure 13: CYP1 (cytochrome P450, subfamily XXV1IB) (Seq ID:339/340) 

Figure 14: Alpha-2-macroglobulin (Seq lD:405/406) 
30 Figure 15: Interleukin 1 receptor antagonist (Seq 10:357/358) 
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Figure 16: SCYA3L (Seq lD:469/470) 

Figure 17: CFFM4 (Seq lD:433/434) 

Figure 18: Pleckstrin (Seq 10:431/432) 

Figure 19: CYPIBI (Seq ID :325/326) 
5 Figure 20: CYPIBI (SeqID: 1 37/1 38) 

Figure 21: Hypothetical protein FLII3511 (SeqID:! 63/1 64) 

Figure 22: Hematopoietic Zinc finger protein (SeqID : 1 7/1 8) 

Figure 23: Osteopontin (SeqID :267/268) 

Figure 24: Osteopontin (SeqID :267/268) 
10 Figure 25: Adipophilin (Seq ID :3 1 3/3 1 4) 

Figure 26: Adipophilin (SeqID;3 1 3/3 1 4) 

Figure 27: Adipophilin (SeqID:3 1 3/3 1 4) . 

Figure 28: Adipophilin (SeqID :3 1 3/3 1 4) 

Figure 29: Hypothetical protein FLJ22690 (SeqID :205/206) 
15 Figure 30: cDNA DKFZp586E1624 (SeqID: 65/66) 

Figure 31: EST (SeqID : 1 97/1 98) 

Figure 32a: EGL nine (C.elegans) homology (SeqID:85/86) 

Figure 32b: Gene expression profiles in macrophages with and without activation. mRNA levels, 
determined from a custom gene array, of clorfl2 are shown on the Y-axis, expressed as a value compared 
20 to the mean value of a set of control genes on each array (per-chip normalisation). All cells were human 
macrophages, cultured either without cytokines or with IL-10 or with the combination of IFN □ and LPS 
in normoxia and hypoxia. 

Figure 32c: Gene expression profiles in macrophages with and without activation. mRNA levels, 
determined from a custom gene array, of EGLN3 are shown on the Y-axis, expressed as a value compared 
25 to the mean value of a set of control genes on each array (per-chip normalisation). All cells were human 
macrophages, cultured either without cytokines or with IL-10 or with the combination of IFNO and LPS 
in normoxia and hypoxia. 

Figure 32d: Clorfl2 (SeqID: 89.90) 
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Figure 32e: The effect of EPAS/ HIF overexpression on expression of the gene Ciorfl2 EGLN genes 
using a custom gene array. mRNA expression levels of the gene clORFI2 as determined by the custom 
array, in response to hypoxia and adenoviral over-expression of HIF or EPAS are shown. Experimental 
conditions are as follows: #1 no adeno / normoxia; #2 empty adeno (low dose)/ normoxia; S3 empty 
5 adeno (high dose)/ normoxia; U empty adeno (low dose)/ hypoxia; #5 empty adeno (high dose)/ hypoxia; 
#6 HIF-1 adeno (low dose)/ hypoxia; #7 HIF-1 adeno (high dose)/ hypoxia; #8 EPAS adeno (low dose)/ 
hypoxia; #9 EPAS adeno (high dose)/ hypoxia. Error bars are the standard error of the mean. 

Figure 32f: The effect of EPAS/ HIF overexpression on expression of the gene EGLN3 gene using a 
custom gene array. mRNA expression levels of the gene EGLN3 as determined by the custom array, in 
10 response to hypoxia and adenoviral over-expression of HIF or EPAS are shown. Experimental conditions 
are as follows: #1 no adeno / normoxia; #2 empty adeno (low dose)/ normoxia; #3 empty adeno (high 
dose)/ normoxia; #4 empty adeno (low dose)/ hypoxia; #5 empty adeno (high dose)/ hypoxia; #6 HIF-1 
adeno (low dose)/ hypoxia; #7 HIF-1 adeno (high dose)/ hypoxia; #8 EPAS adeno (low dose)/ hypoxia; 
#9 EPAS adeno (high dose)/ hypoxia. Error bars are the standard error of the mean. 

15 Figure 32g: The effect of EPAS/ HIF overexpression on expression of the EGLN3 gene using AffyMetrix 
Hu95 ver2 GeneChips. mRNA expression levels of the gene in response to hypoxia and adenoviral over- 
expression of HIF or EPAS are shown. Graphs show the mean of two replicate arrays, with error bars as 
standard deviation. Above each graph, data values are shown, including the normalised values and raw 
values (the AffyMetrix average difference parameter) and Present/ Absent flags. 

20 Figure 32h: The effect of EPAS/ HIF overexpression on expression of the clorf!2 gene using AffyMetrix 
Hu95 ver2 GeneChips. mRNA expression levels of the gene in response to hypoxia and adenoviral over- 
expression of HIF or EPAS are shown. Graphs show the mean of two replicate arrays, with error bars as 
standard deviation. Above each graph, data values are shown, including the normalised values and raw 
values (the AffyMetrix average difference parameter) and Present/ Absent flags. 

25 Figure 32i: Flag immunocytochemistry in HEK293T cells 

Figure 32j: Human Cardiomyocyte Caspase Activity after 72 hours transduction with EIA V-ELG9- 
Homolog3 

Figure 33: Novel Metallothionein (SeqID:83/84) 
Figure 34: Hypothetical protein hqp0376 (SeqID:337/338) 
30 Figure 35: Metallothionein 2A (SeqID;265/266) 
Figure 36: Metallothionein 1G (SeqlD:243/244) 
Figure 37: Metallothionein IH (SeqID: 239/240) 
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Figure 38: Hepcidin antimicrobial peptide (Seq ID : 1 4 1 /l 42) 

Figure 39: EST (SeqlD: 117/118) 

Figure 40: Hypothetical protein FLJ22622 (SeqlD :] 29/1 30) 

Figure 41: TRIP-Br2 (SeqlD :3 1 /32) 
5 Figure 42: Tumor protein D52 (SeqID:301/302) 

Figure 43: Semaphorin 4b (SeqlD:91/92/92a) 

Figure 44: Dec-1 (SeqlD :37 1 /372) 

Figure 45: Calgranulin A (SeqlD:447/448) 

Figure 46: EROl (S. cerevisiae)-like (SeqID:67/68) 
10 Figure 47: Hypothetical protein FLJ20500 (SeqID:25/26) 

Figure 48: N-myc downstream regulated (SeqID:229/230) 

Figure 49: Decidual protein induced by progesterone (SeqlD :387/388) 

Figure 50: Integrin, alpha 5 (SeqlD:379/380) 

Figure 51 : Tissue factor (SeqID:225/226) 

15 Figure 52: COX-2 (SeqlD:237/238) 

Figure 53: Genes up-regulated by macrophage activation. Normalised mRNA levels in the 6 experimental 
conditions (#1 no cytokines/ normoxia, #2 no cytokines/ hypoxia, #3 1L-10/ normoxia,#4 IL-I0/ hypoxia, 
#5 LPS/IFN/ normoxia, #6 LPS/1FN/ hypoxia) are shown as values referenced to the median value of that 
gene throughout all 6 experimental conditions. Error bars show the standard error of the mean. 

20 Figure 54: Genes downregulated by macrophage activation (I) 

Figure 55: Genes downregulated by macrophage activation (II) 

Figure 56: Genes downregulated by macrophage activation (III) 

Figure 57 shows an RNase protection assay for the gene encoding Semaphorin 4b. 

Figure 58 shows a Northern blot showing the size of the mRNA and tissue distribution for the 
25 Semaphorin 4b gene. 
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Examples 
Summary 

Subtracted cDNA libraries were separately prepared for hypoxic macrophages and cardiom yoblasts. This 
involved harvesting RNA from cells both in normoxia and hypoxia, and preparing cDNA. Subtractive 
5 hybridization / suppression PCR was then performed to remove genes from the hypoxic cell cDNA, 
which are also present in cDNA from normoxic cells. Insert DNA from the libraries was PCR amplified 
and arrayed onto duplicate membranes. Quantitative hybridizations with pre-Iibrary cDNA material 
(normoxia and hypoxia) were done to identify clones in the libraries that actually contain hypoxia 
inducible genes. The insert DNA was then sequenced. 

10 This procedure was done independently for macrophage and cardiomyoblast. The hypoxia inducible 
genes identified from these different cell types differed widely, with only a minority of these.genes being 
identified from both cell types. 

To characterise the differences between the two tissues further, arrays were produced containing all 
confirmed hypoxia-inducible genes from the macrophage library. Replicate arrays were hybridised with 
15 cDNA from normoxic and hypoxic cardiomyoblasts to allow quantitative evaluation of these genes in the 
cardiomyoblast. This revealed quantitative differences in the hypoxia induced activation these genes in 
the two cell types. 

Example la: Comparison of the hypoxic-response between human macrophages and 
cardiomyoblasts by a subtraction cloning / array screening approach 

20 Methods /Results 

To isolate human macrophage, monocytes were derived from peripheral blood of healthy human donors. 
IOOml bags of buffy coal from the Bristol Blood Transfusion Centre were mixed with an equal volume of 
RPMII640 medium (Sigma). This was layered on top of 10m] ficol-paque (Pharmacia) in 50ml centrifuge 
tubes and centrifuged for 25 min at 800 x g. The interphase layer was removed, washed in MACS buffer 

25 (phosphate buffered saline pH 7.2, 0.5% bovine serum albumin, 2mM EDTA) and resuspended at 80 
microliter per 10n7 cells. To this 20 microliter CD 14 Microbeads (Miltenyi Biotec) were added, and the 
tube incubated at 4 degrees for 15 min. Following this one wash was performed in MACS buffer at 400 x 
g and the cells were resuspended in 3 ml MACS buffer and separated on an LS+ MACS Separation 
Column (Miltenyi Biotec) positioned on a midi-MACS magnet (Miltenyi Biotec). The column was 

30 washed with 3 x 3ml MACS buffer. The column was removed from the magnet and cells were eluted in 5 
ml MACS buffer using a syringe. Cells were washed in culture medium (AIM V (Sigma) supplemented 
with 2% human AB serum (Sigma), and resuspended at 2 x 10n5 cells per ml in the same medium and 



WO 02/46465 



PCT/G B0 1/05458 



68 

placed in Jarge tenon-coated culture bags (Sud-Laborbedarf GmbH, 82131 Gauting, Germany) and 
transferred to a tissue culture incubator (37 degrees, 5% C02) for 7-10 days. During this period 
monocytes spontaneously differentiate to macrophages. This is confirmed by examining cell morphology 
using phase contrast microscopy. Cells are removed from the bags by placing at 4 degrees for 30 min and 
5 emptying (he contents. The cells are then washed and resuspended in culture medium at 5 x 105 cell/ml 
and plated out in Primeria 10 cm tissue culture petri dishes (Falcon Becton Dickinson) at 5 x 10n6 cells 
per dish. Culture is continued for 16-24hr to allow cell adherence, prior to experimentation involving 
hypoxia. 

As an alternative primary cell type human cardiomyoblast cultures were established. Cells derived from 
10 the ventricular tissue of newborn or foetal hearts were purchased from BioWhittaker (CC-2582). Growth 
conditions were used to allow maximum expansion of the cells in vitro, by using a medium rich in growth 
factors. Under such conditions cardiomyoblast-like cells predominate (the developmental precursor of 
cardiomyocytes). This has been previously described by Goldman and Wurzel {In Vitro Cell Dev. Biol 
28A: 109-1 19 (1992)) and Goldman et ai, (1996, Exp.Cell.Res. 228(2): 237-245). 

15 For these cultures, cells were seeded at IxlO 6 per T150 flask in human smooth muscle growth medium 
(TCS CellWorks ZHM-3935) and were expanded in the same medium up to a maximum number of 4 
passages. The growth medium is purchased pre-prepared, and includes in the formula, 5% fetal bovine 
serum, insulin, epidermal growth factor and fibroblast growth factor. Prior to experimentation involving 
hypoxia, cells were plated onto 10 cm tissue culture petri dishes and allowed to reach confluency. 

20 For experimentation with hypoxia, for all cell types, an equal number of identical culture dishes were 
divided into two separate incubators: One at.37 degrees, 5% C02, 95% air (=Normoxia) and the other at 
37 degrees, 5% C0 2 , 94.9% Nitrogen, 0.1% Oxygen (=Hypoxia). After 6 hours culture under these 
conditions, the dishes were removed from the incubator, placed on a chilled platform, washed in cold PBS 
and total RNA was extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 

25 manufacturer's instructions. Polyadenylated mRNA was extracted from the total RNA using a 
commercial kit following the manufacturer's instructions (Promega; PoIyATract mRNA isolation System 
IV). 

The hypoxia period of 6 hr was previously determined to be sufficient to allow the induction of known 
hypoxia-regulated genes, as determined by RNase protection assays. During these preliminary studies it 
30 was noted that macrophages, cardiomyoblasts and an additional control cell type, Jurkat T-cells, showed 
different patterns of gene induction in response to hypoxia: 

Known Hypoxia-inducible gene level of hypoxia-induced increase in mRNA levels 

Macrophage Myoblast T-cell 
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phosphoglycerate kinase-J 
(PGK) . 

vascular endothelial growth factor-A 
(VEGF) 

5 solute carrier family 2, member 1 
(GIut-1) 



high 



high 



none 



low 



low 



none 



high 



high 



high 



Separate subtracted cDNA populations were generated from mRNA extracted from hypoxic macrophages 
and hypoxic cardiomyoblasts, using a combination of two kits, purchased from Clontech Laboratories- 
SMART PCR cDNA synthesis kit and PCR Select cDNA subtraction kit. The manufacturer's instructions 
10 were followed for both kits. All diagnostic steps were followed as recommended by the manufacturers. 
All PCR reactions were done using an Applied Biosystems 9700 with 96-well block, using Applied 
Biosystems plastics. Driver and tester populations used for subtraction were as below: 

subtracted cDNA tester driver 

Subtracted macrophage macrophage (hypoxia) macrophage (normoxia) 

Subtracted cardiomyoblast cardiomyoblast (hypoxia) cardiomyoblast (normoxia) 

15 The final subtracted cDNA samples were evaluated by performing RT-PCR using the following primers 
for human beta actin: 

sense: TCACCCACACTGTGCCCATCTACG A 

antisense: CAGCGGAACCGCTCATTGCCAAATGG 

This showed that an additional 5 cycles of PCR were required to achieve similar levels of beta actin 
20 product from subtracted compared to unsubtracted cDNA, indicating a significant reduction in the 
representation of a non-regulated gene in the subtracted cDNA. Glyceraldehyde 3-Phosphate 
dehydrogenase PCR primers, as contained in the kit, were not used. 

The three subtracted cDNA populations were ligated into a plasmid vector (pCRH, Invitrogen) to 
generate libraries, which were transformed into E.coli (INVaF, Invitrogen) and plated out onto agar, 
25 supplemented with ampicillin and X-Gal, according to standard methods. 

Colonies that are white indicate the presence of a recombinant plasmid, and these were picked into 
individual wells of 96-well plates containing 100 microliters LB-Ampicillin, and given 3-8 hr growth at 
37 degrees. In this way, for each library, up to 15 x 96-well plates of clones were generated. 
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To screen clones for the presence of differentially expressed genes, replicate arrays of plasmid insert 
DNA were generated on nylon membranes: Firstly, PCR was performed using nested PCR primers 2R 
and 1, which flank the cDNA insert of each clone (sequence described in the PCR Select kit). The 
reaction mix also contains 200 uM d(A/I\C,G)TP, Advantage2 polymerase mix (Clontech Laboratories) 

5 and supplied lOx buffer. 40 ul reactions were set up in 96-well PCR reaction plates and inoculated with 
0.5 ul bacteria from the library plates. 23 cycles of PCR were performed (95 degrees 10 sec; 68 degrees 2 
min), and a selection of wells were checked on an agarose gel. In this manner a 96-well plate of insert 
DNA was generated for each 96-well plate of bacterial clones. Arrays of insert DNA were generated by 
transferring 4ul of each well to 384-well plates (Genetix), and denaturing the DNA by adding 4ul 0.4M 

10 NaOH and incubating at 37 degrees for 15 minutes. Bromophenol blue was added to the wells to allow 
visualisation of arraying. A 384-pin replicator (Genetix) was used to spot small volumes of denatured 
insert DNA onto dry nylon membranes (Hybond N+, AmershamPharmacia). 

By repeating this operation from the same 384-well plate onto several membranes, matched pairs of 
membranes were produced, suitable for array screening. A fragment of the beta actin gene was spotted at 
15 specific positions of the arrays. Following spotting, the membranes were left at room temperature for 2 hr, 
re-denatured by placing on chromatography paper wetted with 0.3 M NaOH, neutralised by placing on 
chromatography paper wetted with 0.5 M Tris pH 7.5, dried at room temperature for 2 hr and crosslinked 
by exposing to 2000 joules UV radiation. Prior to hybridisation, residual salts were removed from the 
arrays, by washing in hot 0.5% SDS. 

20 Matched pairs of membranes were hybridised with subtracted cDNA samples; from hypoxic and 
normoxic cells, to determine the abundance of the genes corresponding to each spotted clone in the cDNA 
samples. Because the cDNA probes were subtracted, large differences in the hybridisation signal for 
individual spots were apparent, which can be identified by eye. Prior to probe labelling, subtracted cDNA 
samples were digested with Rsal and run through Qiagen Qiaquick PCR purification columns to remove 

25 adapter sequences added during the PCR Select procedure. 25 ng cDNA was labelled with 33P using a 
commercial kit following the manufacturer's instructions (Promega, Prime-a-gene kit), and 
unincorporated label was removed using BioRad Biospin-6 columns following adding 2.5ug yeast tRNA 
carrier. 

Pre-hybridisation, hybridisation and washes were performed essentially according to the Research 
30 Genetics GeneFilters protocol, but supplementing the hybridisation mixture with 10 ug of a cocktail of 
oligonucleotides complementary to the Clontech PCR Select nested PCR primers (equimolar mix of 
primers 1 and 2R and their reverse complements). 
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Hybridized arrays were exposed to X-ray film or were exposed to a phosphorimager (Molecular 
Dynamics, Storm) and clones showing gross differences in the hybridization signals with hypoxic 
compared to normoxic cDNA probes were identified. This procedure was used to process all clones 
originally picked from the primary libraries and PCR amplified. The selected clones were grouped 
5 together onto a single array (referred to here as a secondary array), and were re-scrtcned with cDNA 
probes which had not been subtracted, to allow a more quantitative though less sensitive, evaluation of 
the relative abundance of the genes in hypoxia vs. normoxia. 

In this case, probes were ds cDNA generated from the Clontech SMART cDNA synthesis kit (labelled 
using the Promega Prime-a-gene kit) or were total RNA (labelled according to the Research Genetics 
10 GeneFilters protocol), and hybridisations were done according to the Research Genetics GeneFilters 
protocol. 

Hybridization signals were measured using a phosphorimager and were processed with Array Vision 
(Imaging Research Inc) software using multiple bela-actin spots to normalise the quantitation and 
individual spot background correction. At this stage, the inserts of clones showing consistent up- 
15 regulation in hypoxia were sequenced using the 2R primer. 

The identity of the genes were determined using BLAST at the NCBI (NLM, NIH) against the non- 
redundant data base collection. Where significant matches to human genes were not made, the human 
EST database was used. For both EST and non-EST hits, identifier numbers were also obtained from the 
UniGene database. 

20 The above strategy was used independently for libraries derived from macrophages and from 
cardiomyoblasts. By screening a relatively large number of clones (several thousand per library), single 
genes were identified from multiple clones from any individual library. Multiple clones covered either the 
same or different regions of the genes. 

In the above manner, certain hypoxia-inducible genes were identified from clones only derived from the 
25 cardiomyoblast library. These genes are listed in Table 1. Certain hypoxia-inducible genes were identified 
from clones only derived from the macrophage libraries. These genes are listed in Table 2. Certain 
hypoxia-inducible genes were identified from clones derived from both macrophage and myoblast 
libraries. These genes are listed in Table 3. 

It can be seen that Table 3 contains many less genes than either Tables I and 2; demonstrating that these 
30 cell types have large differences in the genes induced by hypoxia. Importantly, the subtracted libraries for 
macrophage and cardiomyoblast were constructed in parallel. Therefore, major differences in the 
spectrum of genes isolated from these libraries are likely to be due to differences in the starting material, 
rather than due to technical differences in the production of the libraries. Importantly, the genes contained 
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in these tables were confirmed to be hypoxia-regulated in the relevant cell type(s) by the described two- 
stage array hybridisation screening process. 

From Table 3 it is clear that although this subset of genes was found in subtracted libraries from both 
hypoxic macrophages and cardiomyoblasts, the fold-induction obtained between hypoxia and normoxia, 
5 for the different tissues differs widely. For (he first 5 genes in this table, the hypoxia response is greater 
for macrophages, whereas for the last 2 genes it is greater for cardiomyoblasts. 

To test whether genes isolated only in the macrophage-derived subtracted libraries are not responsive to 
hypoxia in cardiomyoblast, cardiomyoblast cDNA isolated from normoxic and hypoxic cells was 
hybridised to an array of macrophage-derived clones. These data are presented as a scatter plot, showing 

10 normalised signal intensities in hypoxia versus normoxia, with each dot representing a single gene on the 
array. This plot is presented in Figure 1. A gene that is not affected by hypoxia will localise around the 
y=x line, running diagonally through the centre of the graph. From the figure, it can be seen that most 
genes lie in this region, even though all the genes were responsive to hypoxia in the macrophage (Table 
2). There is a subset of genes that lie beneath this region (x>y), representing induction of these genes by 

15 hypoxia in the cardiomyoblast. 

Sequence data for the cDNA inserts of clones from our custom subtracted cDNA libraries is available. 
These are usually short fragments of 300-1000 bp. Some have been resequenced to obtain an accurate full 
insert sequence (see document "gene sequences/analysis"). 

Several of the genes presented in Tables 1 -3 encode hypothetical proteins of unknown function and others 
20 have no database matches with protein coding sequence. The work presented here provides some 
functional annotation for these genes, and potential applications for the treatment of disease. Certain 
genes, in particular the glycolytic enzymes and transporters, have been hypothesised previously as 
forming part of the generic hypoxia response. The data provided herein provide solid, validating data for 
these hypotheses. 

25 It was surprising to note that cells from our cultures of human ventricle-derived cells, showing a 
cardiomyobiast-iike phenotype, do not support significant induction of the following genes: Lactate 
dehydrogenase A„ Enolase I, Phosphoglycerate kinase 1, Triphosphate isomerase 1 . These genes have 
been identified as being targets of the "ubiquitous" transcription factor HIF-1 alpha ("HIF-1 : mediator of 
physiological and pathophsiological responses to hypoxia" J.Appl.Physiolii: 1474-1480 (2000)). 

30 Example lb: Preparation of custom array 

To confirm the findings presented in Example la, and to obtain more accurate and additional data, both 
the subtracted cDNA'iibrary clones and the IMAGE clones identified from the Research Genetics Human 
GeneFilters have now been fabricated by the authors into an independently produced and verified gene 
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array (referred to herein as the "custom gene array"), composed of PCR-amplified insert DNA. The 
methods used to produce this array are common in the art, but the key points are summarised below. 

Clones from the subtracted cDNA library were PCR amplified as defined in Example la. In many cases, 
there were multiple cDNA clones corresponding to different regions of the same gene, and all these were 
5 represented on the custom gene array. IMAGE clones were obtained from the UK MRC HGMP Resource 
Centre (Hinxton, Cambridge CB10 1SB, UK) and were re-isolated as individual colonies and sequenced 
to verify the.correct identity of the clone. In the majority of cases, the same IMAGE clone identified from 
the Research Genetics Human GeneFilters was selected, but in some instances these clones were not 
available and alternatives were selected, corresponding to the same gene. 

10 Additional genes, with well-defined roles in various disease processes relevant to hypoxia, were also 
represented on the array, as derived from IMAGE clones. It is well established in the literature that genes 
with similar functions are often co-regulated at the mRNA level, as determined by microarray data 
clustering methods (Iyer VR et al Science. 1999 283(5398):83-7; Eisen MB el al Proc Natl Acad Sci U S 
A. 1998 95(25):14863-8). This allows associations to be made between genes of unknown function (as 

15 present in the current specification) to genes of well defined function, in order to add significance to the 
former. 

Normalisation is a key issue in array analysis. The custom gene array is a single colour type array, and 
contains a selection of additional IMAGE clones corresponding to genes which were empirically 
determined not to be affected by hypoxia and which are highly expressed in a wide range of human 
20 tissues and cell types. During data analysis, spot intensities were divided by the mean of all the reference 
genes shown below, each of which was present in quadruplicate on each array. 



Gene IMAGE clone Acc. 

FLU 1 102 fis clone PLACE10O5646 AA464704 

25 matrix Gla protein A A 155913 

guanine nucleotide binding protein alpha stimulating 1 R4358I ' 

DKFZp434A1319 W74725 

cDNA FLJ23280 fis clone HEP07194 AA669443 

beta actin (in house clone) 

30 EFla-like protein AI817566 

ribosomal protein L37a W91881 



IMAGE clone plasmid miniprep DNA was prepared and PCR amplified with flanking vector primers of 
the sequences GTTTTCCCAGTCACGACGTTG and TGAGCGGATAACAATTTCACACAG. This was 
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then purified and concentrated by ethanol precipitation, and the presence of a single band and DNA 
concentration were determined by agarose gel electrophoresis and by digital imaging methods. 

Purified PCR product corresponding to all the clones (IMAGE and non-IM AGE) were normalised to 0.5 
mgl ml by dilution. Arrays were fabricated onto Hybond N+ (Amersham) membranes using a 
5 BioRobotics TAS arrayer (Biorobotics, Cambridge CB37LW, UK) with a 500 micron pin tool. Using 
384-well source plates and a 2x2 arraying format this array was relatively low density, thereby 
eliminating problems of spot-to-spot signal bleed. Also the large pin size and high source plate DNA 
concentration improves the sensitivity of detection. Post-arraying denaturation/ neutralisation was 
essentially as described by Bertucci F et ai, 1999 (Oncogene 18: 3905-3912). 

10 Total RNA was extracted from cells using RNeasy (Qiagen) and 7 micrograms RNA was labelled with 
100 microCi 33P dCTP using 2 micrograms poly dT (10-20 mer) as primer in a reverse transcription 
reaction. First strand RNA was then degraded under alkaline contitions, and this was then neutralised with 
Tris HC1 pH 8.0, and the labelled cDNA was purified using BioRad BioSpin-6 chromatography columns. 
Pre-hybridisation was performed in 4 ml Research Genetics MicroHyb solution supplemented with 

15 lOmicrograms poly dA (10-20 mer) and 10 micrograms Cot-1 DNA, at 45 degrees for 2-3 hours. The 
cDNA was then denatured by heating and added to the pre-hybridisation, which was continued for 18- 
20hr. Washing steps were done as follows: 2xSSC/ 1% SDS 2x20min at 50 degrees and 0.5xSSC/ 1% 
SDS lOmin at 55 degrees. Arrays were exposed to Amersham Low Energy phosphor screens for 24hr and 
scanned using a phosphorimager at 50 micron resolution. Image analysis was done using ArrayVision 

20 software (Imaging Research Inc). Tab delimited data files were exported and a full analysis performed 
using GeneSpring software (Silicon Genetics). 

Using the described methodology a dynamic range of detection of 4 logs and a sensitivity of at least 1 / 
50,000 is obtained, as determined by spike doping titration experiments. Having several technical 
differences compared to the Research Genetics Human GeneFilters as used in the initial filing, data from 
25 the custom gene array is expected to be quantitatively different. 

Example 1c: Hypoxia regulation of gene expression in macrophages by exposing cells to hypoxia +/- 
additional signal amplification. 

The transcription factor HIF-la, is ubiquitously present in cells and is responsible for the induction of a 
number of genes in response to hypoxia. This protein is considered a master regulator of oxygen 
30 homeostasis (see, for example, Semenza, (1998) Curr. Op. Genetics and Dev. 8:588-594). Although HIF- 
la is well known to mediate responses to hypoxia, other transcription factors are also known or suspected 
to be involved. These include a protein called endothelial PAS domain protein I (EPAS1) or HIF-2a, 
which shares 48% sequence identity with HIF-la (Tian H, et al. Genes Dev. 1997 11:72-82.). Evidence 
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suggests that EPAS 1 is especially important in mediating the hypoxia-response in certain cell types, and it 
is clearly detectable in human macrophages, suggesting a role in this cell type (Griffiths et al., 2000, Gene 
Ther.,7(3):255-62). 

As supporting evidence for the hypoxic regulation of the genes contained within this specification, 
5 adenoviral vectors were used to overexpress HIF-la and EPAS1 in primary human macrophages prior to 
exposure to hypoxia, in order to amplify the response. Because the role of these transcription factors as 
mediators of the hypoxia response is very well established, any further increases in the inducibility of 
specific genes resulting from this approach represents credible supporting evidence that those genes are 
responsive to hypoxia. 

10 A commercially available system was used herein to produce adenoviral particles involving the 
adenoviral transfer vector AdApt, the adenoviral genome plasmid AdEasy and the packaging cell line 
Per-c6 (Crucell, Leiden, The Netherlands). The standard manufacturer's instructions were followed. 
Three derivatives of the AdApt transfer vector have been prepared, named AdApt ires-GFP, AdApt HIF- 
la-ires-GFP and AdApt EPASl-ires-GFP. In these vectors, for convenience, AdApt was modified such 

15 that inserted genes (i.e. HIF-la or EPAS1) expressed from the powerful cytomegalovirus (CMV) 
promoter were linked to the green fluorescent protein (gfp) marker, by virtue of an internal ribosome 
entry site (ires). Therefore presence of green fluorescence provides a convenient indicator of viral 
expression of HIF-la or EPAS1 in transduced mammalian cells. The control vector AdApt ires-GFP was 
used to allow discrimination between effects of the inserted genes (i.e. HIF-la or EPAS1) to that of 

20 potential non-specific effects of adenoviral transduction or GFP expression. Standard subcloning methods 
were used to construct the adenoviral constructs as described in detail elsewhere (see co-pending, co- 
owned International patent application PCT/GBOI/00758; Example 2). 

The adenoviral transfer vectors AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP, were verified prior 
to production of adenoviral particles, for their ability to drive expression of functionally active HIF-la or 
25 EPAS1 protein from the CMV promoter in mammalian cells. This was achieved by transient transfection 
luciferase-reporter assays as described (Boast K etal Hum GeneTher. 1999 Sep l;I0:2197-208). 

Using the aforementioned Introgene adenoviral system, caesium-banded, pure adenoviral particles were 
produced for each of the vectors AdApt ires-GFP, AdApt HIF-la-ires-GFP and AdApt EPASl-ires-GFP. 
Following the Introgene manual, adenoviral preparations were quantitated by spectrophotometry, yielding 
30 values of viral particles (VP) per milliliter. 

Primary human macrophages isolated as described above, were washed and resuspended in DM EM 
(Gibco, Paisley, UK) supplemented with 4% fetal bovine serum (Sigma). 5x1 0 6 ceils were plated into 
nine individual I Ocm Primeria (Falcon) tissue culture dishes containing nvedium plus adenovirus as 
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shown below (experimental design), to give a total volume of 10 ml per plate.- Two doses of adenovirus 
were used; 5.3x10 s viral particles / ml (low) and 1 .6x1 0 9 viral particles / ml (high). These amounts were 
chosen following a series of titration experiments. Following culture for 16 hr, during which the 
macrophages adhere to the plate and are infected by the adenoviral particles, the medium was removed 
5 and replaced by IMDM medium (Gibco, Paisley, UK) supplemented with 2% human AB serum. A 
further 24 hr period of culture was allowed prior to experimentation, to allow gene expression from the 
transduced adenovirus. Gene transduction was verified by visually assessing gfp expression and 
expression of the viral HlF-la and EPAS1 genes was determined by real time quantitative RT-PCR using 
an ABI Prism 7700 TaqMan and CyberGreen protocol. For the high doses of virus, the total levels of 
10 HIF-la or EPAS1 mRNA present in the transduced cells were increased by 10-30 fold. 

For experimentation with conditions of hypoxia, identical culture dishes were divided into two separate 
incubators: One at 37 degrees, 5% C02, 95% air (=Normoxia; equivalent to 20% Oxygen) and the other 
at 37 degrees, 5% C02, 94.9% Nitrogen, 0.1% Oxygen (=Hypoxia). After 6 hours culture under these 
conditions, the dishes were removed from the incubator, placed on a chilled platform, washed in cold PBS 
15 and total RNA was extracted using RNeasy (Qiagen) following the manufacturer's instructions. 
Experimental design 

Condition Adenovirus Adenovirus Oxygen 

(type) amount (%) 

(low=5.3x!0 8 vp/ml 

20 high=1.6xl0 9 vp/ml) 

1 none none 20 

2 AdAptires-GFP ' low 20 

3 AdAptires-GFP high 20 

4 AdAptires-GFP low 0.1 
25 5 AdAptires-GFP high 0.1 

6 AdAptHIF-la-ires-GFP low 0.1 

7 AdAptHIF-la-ires-GFP high 0.1 

8 AdApt EPASl-ires-GFP low 0.1 

9 AdApt EPASl-ires-GFP high 0.1 

30 

RNA samples from the experimental conditions shown above were each hybridised to individual copies 
of the Custom gene array and processed as described earlier. To ensure reproducible data, this was 
repeated so each RNA sample was hybridised to 4 separate arrays. Therefore a total of 36 arrays were 
used for this experiment. Data analysis was done taking the mean signal of each spot from the four array 
35 replicates of each RNA sample. When displayed graphically, standard error of the mean is displayed as 
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(he error bar. Expression values were calculated so that they represent the fold-change ratio as compared 
to condition^ , i.e. untreated cells. 

For genes shown in Table 4 it can be seen that in cells transduced by the control adenovirus AdApt ires- 
GFP there is a response to hypoxia (conditions 4,5) as compared to in normoxia (conditions 2,3). 
5 However this response is significantly greater when the natural hypoxia response is amplified by 
overexpression of HIF-lalpha from the adenovirus AdApt HlF-la-ires-GFP (conditions 6,7). 
Furthermore, this effect is usually dependent on the amount of HIF1 alpha overexpression (i.e. greater in 
condition 7 compared to 6). This same data is displayed graphically in Figure 2. It can be seen that these 
genes encode metallothionein proteins. One of these (Nucleotide Seq ID No. 84; Protein Seq ID No. 83) 
10 is a novel member of the matallothionein family. Several metallothionein genes are known in the art to be 
activated by hypoxia, supporting the usefulness of this data. 

For genes shown in Table 5 and Figure 3 it can be seen that in cells transduced by the control adenovirus 
AdApt ires-GFP there is a response to hypoxia (conditions 4,5) as compared to in normoxia (conditions 
2,3). However this response is significantly greater when the natural hypoxia response is amplified by 
15 overexpression of EPAS1 from the adenovirus AdApt EPAS1 -ires-GFP (conditions 8,9). 

In the case of the protein encoded by Seq ID No. 24, results are available independently for two separate 
cDNA clones representing non-overlapping regions of the same full length gene. 

In the case of the protein encoded by Seq ID No. 86 (EGL nine (C.elegans) homolog 3), additional 
evidence is described above in support of the function of this protein. Furthermore, real time quantitative 

20 RT-PCR analysis of this gene using an AB1 Prism 7700 TaqMan and CyberGreen protocol, has been 
performed, to verify and more accurately quantitate the upregulation of EGL nine (C.elegans) homolog 3 
in response lo hypoxia and EPAS1 adenoviral overexpression. The main difference between the array- 
based and real time quantitative RT-PCR methodologies is that the latter is far more sensitive and 
therefore can detect expression in the off-state (here normoxia) for weakly expressed genes. This data has 

25 shown an induction ratio of 819-fold for EGL nine (C.elegans) homolog 3 in response to hypoxia with 
additional EPAS1 expression, from RNA generated from an independent experiment. This data was 
normalised to beta actin. 



Similarly another weakly-expressed EPASl-induced gene,' Semaphorin 4b (Seq ID No. 91/92; see 
additional discussion above) has been determined using real time quantitative RT-PCR methodology, 
30 showing an actin-normalised induction ratio of 30.1 is found (data not shown). 
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For the gene shown in Table 6 and Figure 4 it can- be seen that in cells transduced by the control 
adenovirus Ad Apt ires-GFP, there is a negative response to hypoxia (conditions 4,5) as compared to in 
normoxia (conditions 2,3). However, this response is significantly greater when the natural hypoxia 
response is amplified by overexpression of HIF1 alpha or EPAS 1 (conditions 6,7,8,9). 

5 Example 2: Differences in the hypoxia responses of resting and activated macrophages. 

Macrophages accumulate at hypoxic areas in various disease states, including cancer, rheumatoid 
arthritis, atherosclerosis and wound healing. At these sites macrophages activation is liable to occur, such 
as in response to T-cell derived gamma interferon. For instance, in atherosclerotic plaques there is an 
accumulation of both T-cells and macrophages, and these are known to interact with one another 
10 (reviewed in Lusis AJ, Atherosclerosis. Nature. 2000 Sep 14;407(6801):233-41). 

It is well established that the macrophage has a significant role in the pathology of the above diseases 
involving hypoxia, and that most functions of the macrophage (including inflammatory functions) are 
greatly increased following activation. Therefore any therapeutic strategy aimed at the hypoxic 
macrophage, needs to also consider the effects of macrophage activation and possible cross talk between 
15 the responses to macrophage activation and hypoxia. 

2.1: Research Genetics Human GeneFilters 

This work was carried out using Research Genetics Human GeneFilters, which contain DNA derived 
from clones of the IMAGE cDNA collection, representing genes of varying degrees of characterisation. A 
series of 6 arrays of human genes were used (GeneFilters GF200-205), potentially covering a total of 
20 31,104 genes. Generally, single genes are represented only once in these arrays. However, sometimes 
IMAGE clones initially thought to represent separate genes, upon re-analysis were found to be different 
regions of the same gene. Here we have presented data for all clones individually, though they possess the 
same UniGene ID and gene name. An example is Hypothetical protein FLJ20037. 

The methodology for Research Genetics arrays is similar in principle to that described for the array 
25 screening of clones from subtracted libraries. There are several attributes to this method: Relatively small 
amounts of RNA can be labelled to make cDNA probes, in a single step reaction, and probes are labelled 
with the same chemical group (33P), so there are no errors introduced as a result of using different dyes, 
which may differ in stability etc. Using a Phosphorimager allows detection over a wide range of 
intensities (over 4 logs). Overall it is interesting to note a recent study, which has favourably re-evaluated 
30 the performance of the nylon based array, as compared with the glass-based microarray method (Berlucci 
F el of, Hum Mo/ Genet 8:1715-1722 (1999)). 
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Experiments were done essentially as described in the Research Genetics GeneFilters protocol. Duplicate 
copies of each array from the same production batch, were used and hybridised in parallel with labelled 
RNA isolated from normoxic and hypoxic primary human macrophages. Hybridised arrays were scanned 
twice using a Molecular Dynamics Storm phosphorimager, and both images were analysed to ensure 
5 reproducibility. Furthermore, the experiments were repealed using the same RNA samples, but with 
different array lot numbers, again to ensure reproducibility. 

Analysis was performed using Research Genetics Pathways software, with normalisation using the 'all 
data points* option. Analyses were output as spreadsheets and filtered to remove data points where the 
signal intensity was less than 4-fold above the general background for the experimental condition with the 
10 higher signal (hypoxia or normoxia depending on whether hypoxia causes induction or repression). 
Sometimes expression in the lower state was not significantly above background, and the ratio will 
therefore be underestimated. Ratios were calculated by normalised signal intensity in hypoxia divided by 
normoxia. Changes were verified visually from the original array images. 

In this manner, comparisons were made between normoxia and hypoxia in resting macrophages. The 
15 whole procedure was then repeated for activated macrophages, to investigate possible differences in the 
response to hypoxia. It is possible that potential differences for certain genes could be correlated with 
changes in expression resulting from activation, prior to challenge with hypoxia. To explore this 
possibility, comparisons were made between resting and activated macrophages, both in normoxia. Since 
some of the genes we have identified as being activated by hypoxia have very low hybridisation signals in 
20 normoxia (for both resting and activated macrophages), this comparison was not possible. 

We have found various patterns of gene expression changes occurring in response to hypoxia, related to 
the activation state of macrophages, which are presented below. Such a range of responses, specific to 
various subsets of genes, was not expected, and contradicts a view that the hypoxia response is a largely a 
generic mechanism. 

25 Table 7 shows genes that are induced by hypoxia to a similar degree in resting and activated 
macrophages. 

Table 8 shows genes that are induced by hypoxia to a greater degree in resting macrophages, as compared 
to activated macrophages. These data are presented illustratively in Figure 5. 

Data from Table 8/Figure 5 reveals several unexpected observations. 

30 A) From the final column it can be seen that macrophage activation in the absence of hypoxia, causes 
induction of many of these genes. This suggests that the signalling pathways resulting from 
activation and hypoxia might converge to a single transcriptional regulator, causing macrophage 
activation to pre-empt the response to subsequent hypoxia. This is exemplified most strikingly for 
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Interleukin 8, which is dramatically induced in response to macrophage activation, but shows no 
additional response to hypoxia. 

B) Genes in rows II, 13 and 14 have no response to hypoxia following macrophage activation, 
though there is not a preceding large increase in expression in response to macrophage activation 

5 alone. This suggests that in the activated macrophage, the necessary signalling pathway or 

transcriptional regulator is not functional. 

C) Although Table 8 was produced electronically, without selecting genes based on their names, it can 
be seen that genes encoding proteins of the metallothionein family feature strongly. 

Table 9 shows genes which are induced by hypoxia to a greater degree in activated macrophages, 
10 compared to resting macrophages. These data are presented illustratively in Figure 6. 

In Table 7, there are several genes for which hypoxia/ normoxia ratios were only obtained for activated 
macrophages, such as Cox-2 (see row 47). For these genes, macrophage activation usually increases 
expression of the gene to detectable levels, thus allowing the study of subsequent changes in response to 
hypoxia. It is likely that these genes are not significantly expressed in resting macrophages irrespective of 
15 hypoxia, and therefore the hypoxia response is probably specific to activated macrophages. 

Certain genes respond to hypoxia by decreasing mRNA expression (repression), and these genes therefore 
have hypoxia/normoxia ratios of < 1 .0. This phenomenon is known in the field of hypoxia, although the 
mechanism is obscure. Data is presented in tables 7-9, which unexpectedly shows that this hypoxia- 
induced repression for specific genes is not a generic process, but is dependent on the cellular context. In 

20 Table 10/ Figure 7 } genes are presented that are hypoxia-repressed to a greater degree in activated 
(column 7) compared with resting (column 8) macrophages. Prior to any hypoxic challenge, these gene 
are induced to varying degrees, in response to macrophage activation (column 9), suggesting a shared 
mechanism for these separate responses. From Table 10, genes in rows 1-6 show that macrophage 
activation is necessary to obtain any response to hypoxia. In resting macrophages, these genes are not 

25 responsive to hypoxia at all. 

Strikingly, Table 10/ Figure 7 shows that seven separate genes encoding chemokine proteins (Monocyte 
chemotactic protein 1, Macrophage inflammatory protein lb, Monocyte chemotactic protein 3 and Small 
inducible cytokine A3, Monocyte chemotactic protein 2, Macrophage inflammatory protein 2a and 
Macrophage inflammatory protein 2 precursor) are more strongly repressed in activated macrophages as 
30 compared to resting macrophages. These genes are also among the most inducible in response to 
activation alone, in normoxia (column 9). These findings are of potential utility in view of the great 
significance of chemokines to inflammatory disease. For example, macrophage chemotactic factor 1 
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(Table 10, row 19) is key to the pathological role of the macrophage in atherosclerosis ("Chemokines and 
atherosclerosis" Reape TJ and Groot PHE, Atherosclerosis 147: 213-225, 1999). 

Genes in rows 20-30 of Table 10, were not detectably expressed in resting macrophages, irrespective of 
hypoxia. Table 1 1 shows other genes that were down-regulated in response to hypoxia in macrophages. 

5 Example 3: Tissue-specific hypoxia regulation of gene expression by an analysis of a series of 
primary human cell cultures. 

Equivalent cultures of non-immortalised, non-transformed primary human cells of 10 distinct types, were 
cultured in either normoxia or were exposed to hypoxia for 6 hr and 18 hr, and gene expression changes 
were determined. To the inventors' knowledge, this is the first time that such a study has been reported. 
10 Moreover, unlike the vast majority of information in the public domain relating to genes responsive to 
hypoxia, all of these cells were human and were cultured without any modifications following isolation 
from the human donors. By using primary ceils rather than cell lines or immortalised cultures, the 
findings of this work more accurately represents the situation in the human body. 

Most cell types were obtained from Clonetics (distributed by BioWhittaker, Walkersville, MD) and 
15 cultured according to the manufacturer's recommendations, unless where otherwise shown. #I:adipocyte 
(Clonetics CC-2568; derived from subcutaneous adult adipose tissue), #2:cardiomyocyte (Clonetics CC- 
2582; derived from fetal tissue; prior to experimentation cultured in minimal medium: DMEM, 4% Horse 
serum), #3 rendothelial (TCS CellWorks ZHC-2101 human umbilical vein endothelial cells), #4:fibroblast 
(Clonetics CC-2511 dermal fibroblasts derived from adult tissue), #5:hepatocyte (Clonetics CC-2591, 
20 derived from adult tissue), #6:macrophage (derived from human blood as described elsewhere in the 
specification), #7:mammary epithelial (Clonetics CC-255I; derived from adult tissue), #8:monocyte 
(derived from human blood as described elsewhere in the specification but without the 7 day 
differentiation culture period), #9:neuroblastoma (neuroblastoma-derived cell line SH-SY5Y), #10:renal 
epithelial (Clonetics CC-2556; derived from fetal tissue), #11 skeletal muscle myocyte (Clonetics CC- 
25 2561; derived from adult. tissue). A non-primary cell type (#9) was used to represent neurons, since 
primary human neurons are difficult to source. Therefore a total of 11 cell types are compared. It should 
be noted that RN A from hepatocytes at the I6hr timepoint of hypoxia was not available for this work. 

Genes which were induced or repressed preferentially in particular cell type(s) were identified by 
hybridisation of the RNA samples to the custom gene array, as described in Examples lb and lc. Each 
30 RNA sample was hybridised to duplicate or triplicate arrays, to ensure reproducible data, and was 
analysed using GeneSpring software. Data from replicate arrays were merged during analysis to generate 
mean values. Data normalisation was achieved per-array using the aforementioned list of control genes, 
such that differences in RNA labelling or hybridisation due to experimental variation were corrected by 
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referencing each gene to (he mean value of the reference genes on the same array. Also, for each gene, 
expression values were obtained which represent the value in each experimental condition (e.g. 
macrophages 6hr hypoxia) as compared to the median of value of that gene throughout the full range of 
experimental conditions (i.e. from all cell types). This transformation does not alter the relative values of 
5 any gene between the different experimental conditions, and is done since these is no obvious single 
reference experimental condition to create ratio values. This is common in microarray data analysis. 

Table 12 shows the full dataset of this analysis. From this it can be seen that certain genes respond to 
hypoxia differently, depending on the particular cell type. This information is valuable in identifying 
biological targets for the development of therapeutic and diagnostic products. Not only does it indicate a 
10 particularly significant role for these genes in the specific cell type implicated in a disease, but it also 
identifies that any therapeutic product is less likely to produce problematic toxicological effects. Data 
shown in Table 12 and the derived figures, are reproducible, and are an accurate determination of mRNA 
expression levels. This may be confirmed by independent means, such as quantitative real time RT-PCR. 

Certain genes from Table 12 will be presented for illustration. 

15 Genes with a greater response in monocytes or macrophages 

Since monocytes and macrophages are similar cell types, the latter derived from the former, they will be 
analysed logether. 

Expression profiles of 11 genes showing hypoxia-induced changes in gene expression which are most 

pronounced in monocytes or macrophages are shown in Figures 8-18. These genes correspond to: 
20 SeqID:339/340 CYP1 (cytochrome P450, subfamily XXVIIB) 

Seq ID:357/358 interleukin 1 receptor antagonist 

Seq ID:375/376 Regulator of G-protein signalling 1 

Seq ID:389/390 GM2 ganglioside activator protein 

Seq ID:405/406 Alpha-2-macrogIobuiin 
25 Seq 10:475/476 Ecotropic viral integration site 2A = 

Seq ID:433/434 high affinity immunoglobulin epsilon receptor beta (CFFM4) 

Seq 10:431/432 Pleckstrin 

Seq 10:469/470 cytokine effector of inflammatory response SCYA3L 
Seq 10:79/80 Novel PI-3-kinase adapter 
30 Seq 10:21/22 Hypothetical protein PRO0823 

It will be appreciated that the majority of these genes have a known biological function in immunity/ 
inflammation, consistent with the known function of the monocyte/ macrophage. Further to this 
knowledge, this data identifies that in hypoxic disease sites where monocyte/ macrophages make up a 
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significant proportion of the cell types, such as in rheumatoid arthritis synovial membranes, that these 
genes are possible therapeutic targets. 

Ecotropic viral integration site 2A (Seq ID:475/476) 

For example, the gene illustrated in Figure 8, Ecotropic viral integration site 2A (Seq ID:475/476) is 
5 induced in hypoxic monocytes to a level over 25 times higher than the median expression level of this 
gene throughout the other cell types. This gene, of unknown function, is located on Chromosome 17ql 1.2 
close to genes with immune functions. Presented elsewhere in this specification is data showing that 
expression of Ecotropic viral integration site 2A is downregulated in response to the inflammatory 
cytokine interferon gamma. These novel data provide evidence that Ecotropic viral integration site 2A is a 
10 novel target for inflammatory conditions involving hypoxia and monocytes. 

Novel PI-3-kinase adapter Seq ID:79/80 Clone p3E9 (EST accession R62339). 

Another example, in Figure 9a, is Seq ID:79/80 (EST accession R62339). It is seen that in hypoxic 
macrophages, this gene is expressed at 6-fold higher levels than the median expression level of this gene 
throughout the other cell types. Therefore, the levels of the encoded protein in hypoxic monocytes/ 
15 macrophages, as found at various disease sites, are likely to be higher than in other cell types not involved 
in the disease process or present at the site of disease. This illuminates a novel utility of this gene as a 
target for the development of therapeutic products for diseases involving monocytes/ macrophages and 
hypoxia. 

The data that led to the generation of this Figure are as follows: 
20 Cell type Oxygen Normalised expression 

(clone plE9/SeqID:79/80) 





adipocyte 


normoxia 


1.54 




adipocyte 


hypoxia 6hr 


0.89 




adipocyte 


hypoxia 18hr 


1.48 


25 


cardiomyocyte 


normoxia 


1.18 




cardiomyocyte 


hypoxia 6hr 


1.80 




cardiomyocyte 


hypoxia 1 8hr 


1.53 




endothelial 


normoxia 


0.68 




endothelial 


hypoxia 6hr 


0.82 


30 


endothelial 


hypoxia 18hr 


0.60 




fibroblast 


normoxia 


0.60 




fibroblast 


hypoxia 6hr 


0.64 




fibroblast 


hypoxia 18hr 


0.73 




hepatocyte 


normoxia 


0.92 


35 


hepatocyte 


hypoxia 6hr 


1.62 




macrophage 


normoxia 


4.20 




macrophage 


hypoxia 6hr 


3.97 




macrophage 


hypoxia 18hr 


6.19 




mammary epithelial 


normoxia 


0.25 


40 


mammary epithelial. 


hypoxia 6hr 


0.42 
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mammary epithelial 


hypoxia 18hr 


0.18 


monocyte 


normoxia 


2.33 


monocyte 


hypoxia 6hr 


3.63 


monocyte 


hypoxia I8hr 


5.01 


5 neuroblastoma 


normoxia 


0.93 


neuroblastoma 


hypoxia 6hr 


0.80 


neuroblastoma 


hypoxia 1 8hr 


0.85 


renal epithelial 


normoxia 


0.57 


renal epithelial 


hypoxia 6hr 


0.61 


10 renal epithelial 


hypoxia I8hr 


0.61 


skeletal myocyte 


normoxia 


1.58 


skeletal myocyte 


hypoxia 6hr 


1.37 


skeletal myocyte 


hypoxia 18hr 


1.17 



15 To subsiantiate the array-based data, the same RNA samples were examined by real time quantilative RT- 
PCR. The advantages of this method are that it is more sensitive and because two gene-specific primers 
are used, the data will be more specific to the gene in question. 

RNA from the above samples (except for the hepatocyte RNA which was unavailable) was Dnase I- 
treated prior to reverse transcription to remove possible contaminating genomic DNA and was reverse 
20 transcribed using an oligo dT ( | 5 ) primer and Superscript II reverse transcriptase. These samples were 
used as template for PCR reactions using primers specific to EST accession R62339 or to beta-actin. 
Primer sequences were as follows: 

Novel Pl-3-kinase adapter Seq W:79/80 Clone pJE9 (EST accession R62339). 
Forward Primer 5' GCC CTT AGT TTT TCA CTT CTT CGT 3' 
25 Reverse Primer 5' CCT TA A GAT CCA TTC TCA TTG CTG AT 3' 
Beta Actin 

Forward Primer 5' GCC CTG AGG CAC TCT TCC A 3 
Reverse Primer 5' GCG GAT GTC CAC GTC ACA 3' 

All RT-PCR reactions were performed using an ABI Prism 7700 Sequence Detector system. For each Q- 
30 PCR run, a master mix was prepared with 2x SYBR Green I master mix (Applied Biosystems) and 
primers at 5\\M. Two microlitres of respective diluted cDNA were added to PCR master mixture, 
amounting to 25|nL. The thermal cycling conditions comprised 50°C for 2 minutes, 95°C for 10 minutes, 
40 cycles at 95°C for 15 seconds, and 60°C for 1 minute. PCR reactions were set up in 96 well format 
with duplicate amplifications for each data point including 8 serial cDNA dilutions (0.2, 0.1, 0.05, 0.025, 
35 0.01, 0.005, 0.001 and 0.0001) of macrophage treated with 18 hours hypoxia to compose a standard 
curve, a no template control, no amplification control lacking reverse transcriptase, and each cDNA 
sample at a dilution value of 0.1. The experiment for the novel PI3K adapter was carried out in triplicate 
for reproducibility which were later determined by linear regression analysis. Data was analysed with 
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necessary adjustment of the default baseline and threshold line using ABI Prism 7700 software. The C t 
value, an important raw data for each sample, was calculated as the cycle number at which the ARn 
crosses the baseline. For each run, a standard curve was constructed by plotting a graph with mean C, 
values from 8 data points from standard sample against log input of the corresponding dilution values 
5 with a best fit trend line. From the trend line, the formula 'y=mx+c' was created according to the y- 
intercept and slope of standard curve which then were used for calculating the log input amount of the 
experimental cDNA samples, as related to the calibration sample. Data for the Novel Pl-3-kinase adapter 
was normalized to that of beta-actin to correct for potential differences in efficiency of cDNA synthesis 
between the RNA samples. 

10 From the TaqMan data the specificity to monocytes and macrophage found from the array data is 
confirmed and found to be even more pronounced (see Figure 9b). The data presented in the Figure are 
listed below. In the data listed below, the normalized expression values are multiplied by 1000 for clarity. 
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adipocyte 


normoxia 
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adipocyte 


hypoxia 6hr 


0.007 




adipocyte 


hypoxia 18hr 


0.015 




cardiomyocyte 


normoxia 


0.163 




cardiomyocyte 


hypoxia 6hr 


0.037 
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cardiomyocyte 


hypoxia 18hr 


0.222 




endothelial 


normoxia 


3.093 




endothelial 


hypoxia 6hr 


0.059 




fibroblast 


normoxia 


0.527 




fibroblast 


hypoxia 6hr 


0.043 
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fibroblast 


hypoxia 18hr 


0.037 




macrophage 


normoxia 
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macrophage 


hypoxia 6hr 
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hypoxia 18hr 


! 162.056 




mammary epithelial 
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hypoxia 6hr 
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hypoxia 18hr 
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monocyte 
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hypoxia 6hr 
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hypoxia 18hr 
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There are several technical reasons why the results from the array-based data might be more pronounced 
in the Taqman results - the lower sensitivity of the array-based method. means that genes which are not 
expressed will be detected as a background signal. Also the array method is more likely to suffer from 
cross-hybridisation between similar genes. 

5 The TaqMan data illustrates dramatically the concept that the hypoxia response is not just a generic 
response found in all cell types, relating to generic cell processes such as metabolism. 

Database searches for gene sequences showing identity with IMAGE clone acc:R62339 reveal that there 
are no matching human sequences of any type other than ESTs. This includes full length cDNAs, 
truncated cDNAs, gene sequences from chromosomal data or hypothetical protein gene sequences. 
10 Therefore the human gene represented by IMAGE clone acc:R62339 is a novel human gene. 

Although this human EST is unannotated, by comparison with mouse sequence data (acc AF293806), it 
appears likely to encode a novel human Phosphoinositol 3-kinase (PI3-kinase) adapter molecule, 
homologous to the recently described mouse gene, BCAP. This class of molecule, involved in 
intracellular signalling, have been shown to have utility as a drug target (see Stein RC et a/, "PI3-kinase 
15 inhibition: a target for drug development" MolMed Today. 2000 Sep;6(9):347-57). PI3-kinases are key to 
many cellular processes relevant to human disease, including proliferation, apoptosis and inflammation. 
The data presented for the gene encoded by Seq ID:79/80 provides evidence that the encoded protein is a 
novel drug target in humans, specifically targeting monocyte/ macrophages at hypoxic disease sites. 

In the publication relating to murine BCAP, the protein is identified as an adapter molecule connecting 
20 the non-receptor protein tyrosine kinase Syk to the p85 subunit of PI3-kinase, and therefore to the pivotal 
signalling pathways centred around PI3-kinase (Okada T et al "BCAP: the tyrosine kinase substrate that 
connects B cell receptor to phosphoinositide 3-kinase activation." Immunity. 2000 13:817-27). Although, 
in this report, Syk is acting as the intracellular signalling component of the B cell antigen receptor, which 
is present exclusively on B -cells, Syk has been shown to initiate intracellular signalling from other cell 
25 surface receptors which are expressed on macrophages, including the Fc gamma receptor, the chemokine 
receptor CCR5 and rriacrophage-expressed CD8 (Darby C et al "Stimulation of macrophage Fc gamma 
R III A activates the receptor-associated protein tyrosine kinase Syk and induces phosphorylation of 
multiple proteins including p95Vav and p62/GAP-associated protein". J Immunol 1994 152:5429-37) 
(Kedzierska K et al "FcgammaR-mediated phagocytosis by human macrophages involves Hck, Syk, and 
30 Pyk2 and is augmented by GM-CSF." J Leukoc Biol. 2001 Aug;70(2):322-8.), (Ganju RK et al "Beta- 
chemokine receptor CCR5 signals through SHP1, SHP2, and Syk." J Biol Chem, 2000 275:17263-8.), 
(Lin TJ et al "Activation of macrophage CD8: pharmacological studies of TNF and IL-1 beta 
production." J Immunol. 2000 164:1783-92.). 
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Indeed, syk has been validated as target in macrophages to inhibit inflammatory activities of this cell type 
(Stenton GR et al "Aerosolized Syk antisense suppresses Syk expression, mediator release from 
macrophages, and pulmonary inflammation." J Immunol 2000 Apr l;164(7):3790-7.). 

Additional to the finding that the probable human orthologue of the adapter molecule BCAP is 
5 preferentially hypoxia-induced in human monocytes/ macrophages, we also find from data generated by 
the custom array, that the protein acting immediately upstream of BCAP (i.e. Syk) is also regulated by 
hypoxia in this novel cell type specific manner, greatly increasing the biological significance of the 
original finding (see Figure 9c). The data used to generate this Figure are presented below for clarity. 
Cell type Oxygen Normalised expression 
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(of svk) 




adipocyte 


normoxia 


2.6573591 




adipocyte 


hypoxia 6hr 


1.499927 




adipocyte 


hypoxia 18hr 


1.1 1 15488 
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cardiomyocyte 
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cardiomyocyte 


hypoxia 6hr 
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cardiomyocyte 
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endothelial 
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endothelial 


hypoxia 6hr 
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fibroblast 


normoxia 


0.8542026 




fibroblast 


hypoxia 6hr 


0.7657573 




fibroblast 


hypoxia 18hr 


0.784982 




hepatocyte 


normoxia 


0.5238476 
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hepatocyte 


hypoxia 6hr 


0.8465495 




macrophage 


normoxia 


4.272981 




macrophage 


hypoxia 6hr 


6.144931 




macrophage 


hypoxia 18hr 


10.278416 




mammary epithelial 


normoxia 


1.1023632 
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mammary epithelial 


hypoxia 6hr 


2.7382789 




mammary epithelial 


hypoxia 18hr 


0.7985004 




monocyte 


normoxia 
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hypoxia 6hr 


8.6809225 




monocyte 


hypoxia 18hr 


1 1.58468 
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neuroblastoma 


normoxia 


1.0230793 




neuroblastoma 


hypoxia 6hr 


1.089154 




neuroblastoma 


hypoxia 18hr 


0.7689335 




renal epithelial 


normoxia 


0.88565326 




renal epithelial 


hypoxia 6hr 


1.2609364 


40 


renal epithelial 


hypoxia 18hr 


0.6242461 




skeletal myocyte 


normoxia 


1.3959162 




skeletal myocyte 


hypoxia 6hr 


0.91255134 




skeletal myocyte 


hypoxia 18hr 


0.64795935 



45 In summary, we have shown here that a novel human gene encoding a predicted signalling protein 
. relevant to human disease is activated by hypoxia, specifically in monocytes and macrophages. This data 
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is validated by non-array based means. Furthermore, we identify the protein immediately upstream of this 
signalling system as being co-regulated in this manner too. Therefore the human PI3-kinase adapter 
encoded by IMAGE clone acc: R62339 and the non-receptor tyrosine kinase Syk are both identified here 
for the first time as therapeutic targets for diseases involving hypoxic macrophages, including 
5 Rheumatoid arthritis, chronic occlusive pulmonary disease, atherosclerosis and cancer. Because both 
genes are preferentially expressed in hypoxic macrophages, toxicity effects of therapeutic products 
directed at the encoded proteins are likely to be limited. 

As discussed in detail above, fragments and functional equivalents of the PI-3-kinase adapter protein 
represented in Seq ID :79/80 and other equivalent proteins are included within the present invention, in 
10 addition to ligands that bind specifically to these proteins. Furthermore, the invention also embraces 
purified and isolated nucleic acid molecules encoding these proteins, fragments and functional 
equivalents, vectors containing such nucleic acid molecules and host cells transformed with these vectors. 

Regulator of G-protein signalling I (Seq ID:375/376) 

Another intracellular signalling protein, Regulator of G-protein signalling 1 (RGS1; Seq ID:375/376), in 
15 shown in Figure 1 0. Here the expression levels in the hypoxic monocyte is 30-fold higher than the median 
expression level of this gene throughout the other cell types. The function of this protein is to negatively 
regulate G protein signalling pathways, and inhibit chemokine-induced cell migration of immune cells 
(Moratz C et al J Immunol, 2000 164:1 829-38 and Denecke B et al J Biol Chem. 1999 274:26860-8.). 

Our data suggests that this gene is preferentially expressed in macrophages, consistent with the findings 
20 of Denecke B et al (J Biol Chem. 1999 274:26860-8.). Our novel finding that expression is even further 
enhanced by hypoxia illuminates a mechanism by which cell migration is inhibited in hypoxia, leading to 
an accumulation of these cells at pathological sites of hypoxia. This mechanism is novel and distinct to 
other mechanisms proposed in the art to explain this key aspect of hypoxia and inflammation (for 
example: Grimshaw MJ et al "Inhibition of monocyte and macrophage chemotaxis by hypoxia and 
25 inflammation-a potential mechanism." Eur J Immunol. 2001 31:480-9). 

Furthermore, Figure 10 shows that Regulator of G-protein signalling 1 is upregulated during 
differentiation of monocytes to macrophages, with significance to changes in cell motility. This discovery 
therefore provides that inhibitors of RGS1 have utility in increasing the motility of macrophages that are 
used for cell-based therapies. Accordingly, one embodiment of this aspect of the invention provides for 
30 the use of an inhibitor of RGS1 in therapy, by increasing the motility of macrophage cells. 

GM2 ganglioside activator protein 

The gene shown in Figure 11, GM2 ganglioside activator protein, was originally characterised as a 
lysosomal co-factor required for degradation of gangliosides. It has been proposed to have alternative 
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roles as a stcrzied protein, and can bind and inhibit the actions of the inflammatory mediator, platelet 
activating factor (RigatB eta] Biochem Biophys Res Commun. 1999 258:256-9.). 

Our novel finding, presented in Figure 11, shows that GM2 ganglioside activator protein is induced by 
hypoxia, preferentially in macrophages, suggesting an influence on the inflammatory functions of the 
5 macrophage in hypoxia. 

In Figures 15-18, genes are shown which are expressed preferentially in the monocyte/ macrophage, but 
which are decreased in expression in response to hypoxia. Being expressed at highest levels in the 
monocyte/ macrophage, these genes are more likely to be significant to the biological functions of this 
cell type. 

10 Interleukin 1 receptor antagonist (Seq lD:357/358) 

In Figure 15, the gene interleukin I receptor antagonist (Seq ID;357/358) is seen to be down-regulated by 
hypoxia in the macrophage. Since the function of the encoded protein is anti-inflammatory, then down- 
regulation of this gene would be expected to have a pro-inflammatory effect. Therefore, corrective 
expression of the gene, would be expected to produce therapeutic effects in inflammatory disorders 
15 involving macrophages and hypoxia, such as Rheumatoid Arthritis (Hollander AP et al Arthritis Rheum. 
2001 44:1540-4). This correlates with effects seen from the application the drug Anakrina / Kineret™ 
developed by Amgen. This supports the applicability of the genes disclosed herein as novel targets for 
therapeutic products. 

The example of gene interleukin 1 receptor antagonist also provides good exemplification of the concept 
20 that different cell types respond to hypoxia differently. Here, not only are there quantitative differences, 
but also qualitative differences in that this gene is <Wrt-regulated by hypoxia in macrophages, but up- 
regulated by hypoxia in several other cell types, such as renal epithelial cells (see Figure 15). Such 
findings are not documented in the art. 

The dataset of Table 12 also contains genes which are induced preferentially in monocyte/ macrophages 
25 and also in some but not all other cell types tested. Several of these genes are present as multiple clones 
on the gene array, giving separate data, therefore adding extra confidence to the conclusions. These genes, 
presented in Figures 19-28 correspond to: 

SeqID:3 13/3 14 adipophilin 

SeqlD:163/164 Hypothetical protein FLJ1351 1 

30 SeqID:267/268 Osteopontin 

SeqlD:17/18 Hematopoietic Zinc finger protein 

SeqID:137/138 CYP1B1 

SeqID:325/326 CYP1B1 
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It will also be seen that in the case of CYPIB1 (clones p!F16 and plE3) the hypoxia response in 
monocytes / macrophages is qualitatively different to the other cell types tested, in that the gene is up- 
regulated rather than down-regulated in response to hypoxia. 

Genes with a greater response in endothelial cells 

5 The dataset of Table 12 also contains genes which are induced preferentially in endothelial cells, a cell 
type key to the process of angiogenesis, in response to hypoxia. These genes are as follows, and are 
presented in Figures 29-31: 

SeqID:205/206 Hypothetical protein FLJ22690 
SeqlD:65/66 cDNA DKFZp586E1624 
10 SeqID:197/198 EST 

Genes with a greater response in hepatocytes 

The dataset of Table 12 also contains genes which are induced preferentially in hepatocytes, in response 
to hypoxia. These genes are presented in Figures 32a and 33-38. It is noted that most of these genes, 
including hqp0376, encode proteins of the metallotheionein family. Furthermore, close inspection of these 
15 data reveals that the fold induction in hypoxia compared to normoxia for monocyte/ macrophages are 
very high, though the absolute levels of expression are below that of hepatocytes. 

SeqlD:85/86 EGL nine (C.elegans) homolog 3 

SeqID:83/84 Novel Meiallothionein 

SeqID:337/338 Hypothetical protein.hqp0376 (a metallotheionein) 

20 SeqlD:265/266 Metallothionein 2A 

SeqID:243/244 Metallothionein 1G 

SeqID:141/142 Hepcidin antimicrobial peptide 

SeqID:239/240 Metallothionein IH 

EGL nine (C.elegans) homolog 3 

25 As described above, it has been discovered that a polypeptide encoded by a gene identified from the EST 
recited in SEQ ID No 86, having the Protein accession number BABI5101 (encoded by Homo sapiens 
cDNA: FLJ21620 fis, clone COL07838 Nucleotide accession AK025273) is regulated by hypoxia. Other 
public domain sequences corresponding to this gene include Homo sapiens cDNA: FLJ23265 fis, clone 
COL06456 Nucleotide accession AK026918. Accordingly, when referring in the present specification to 

30 the EST recited in SEQ ID No 86, it is intended that these gene and protein sequences are also embraced. 
This gene was identified using Research Genetics Human GeneFilters arrays, which contain an EST 
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corresponding to the gene (accession number R00332). The gene is now termed EGL nine (C.elegans) 
homolog3. 

There are no reports that describe the function of this human gene. However, a high degree of amino acid 
homology is observed between the protein encoded by this gene, and a rat protein called "Growth factor 
5 responsive smooth muscle protein" or "SM20" (Nucleotide accession U06713; Protein accession 
A53770). An alignment of single letter amino acid sequences is shown below. Over the highlighted 
region there is 97% amino acid similarity and 96% amino acid identity. 



A53770 ( 1 ) MTLRSRRGFLSFLPGLRPPRRWLRISKRGPPTSHWASPALGGRTLHYSCR 

10 BAB15101 (1) — 

51 100 
A5377 0 ( 51 ) SQSGTPFSSEFQATFPAFAAKVARGPWLPQVVEPPARLSASPLCVRSGQA 

BAB15101 (1) 

101 150 

15 A53770 (101) LGACTLGVPRLGSVSETOLGH€ 



BAB15101 (1) 

151 — ■ - ■ — — - - — 

A53770 (151) j^EW^ 

BAB 15 101 (35) t^E^GDGVLER 

20 201 "" " '""^ ' " "1 — — — J! 250 

• A53770 (201) j§^EA]^ 

BAB1 5101 (85) ij^E^ 

251 J " " T L ' " ~~ J66 

A53770 (251) ji^l^G^ 
' 25 BAB15101 (135) ^NPJ^^ 

A53770 (301) pi'FSjWS^ 
BAB15101 (185) 

351 



30 A53770 (351) 

BAB15101 (235) L ! ™ ! 



The high degree of amino acid similarity. suggests that the human protein BAB 15101 has an equivalent 
biochemical function to the rat protein A53770 ("Growth factor responsive smooth muscle protein" or 
35 "SM20"). Recent publications have shown that SM20 functions to promote apoptosis in neurons 
(Lipscomb et al } J Neurochem 1999; 73(1 ):429-32; Lipscomb et aL, J Biol Chem 2000 Nov 1; [epub 
ahead of print]). Significantly, SM20 has been shown to be expressed at high levels in the heart (Wax et 
al, J Biol Chem 1994; 269(17): 13041-7). 

It has also been discovered that a polypeptide encoded by a gene identified from the EST recited in SEQ 
40 ID No 90, having the Protein accession number CAB81622, is regulated by hypoxia. The encoding 
human gene has been annotated in the UniGene database as "Similar to rat smooth muscle protein SM- 
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20"; the nucleotide sequence is contained within the nucleotide accession AL1 17352. More recently, a 
longer fragment of this gene has been cloned, named clorfl2, or EGLNI (Nucleotide accession 
AAG34568; Protein accession AAG34568). Accordingly, when referring in the present specification to 
the EST recited in SEQ ID No 90, it is intended that these gene and protein sequences are also embraced. 

5 This distinct human gene, encoding a protein related to SM20 and EGLN3 (BAB 15101), is also induced 
in response to hypoxia. This gene was identified using Research Genetics Human GeneFilters arrays, 
which contain an EST corresponding to the gene (accession number H56028). 

. Independently to this, a fragment of this gene has been cloned from a cDNA library derived from hypoxic 
human cardiomyoblasts, and it has been shown that the gene is increased in expression in response to 
10 hypoxia in this cell type (see Table 1 herein; penultimate row). The nucleotide sequence of this cDNA 
fragment is referred to herein as SEQ ID No 90a. 

In the light of this novel discovery reported herein that these human equivalents of SM 20 are induced by 
hypoxia, it is herein proposed that in cardiac ischaemia, the resulting apoptosis is due at least in part, to 
increased expression of these genes. The therapeutic modulation of the activity of EGLN3 (BAB 15101), 

15 clorfl2 (AAG34568), CAB81622, SM20 and other equivalent proteins and encoding genes therefore 
provides a novel means for the treatment of myocardial ischaemia, through the alteration of the 
propensity of myocardial cells to undergo apoptosis. For example, a suitable treatment may involve 
altering the susceptibility of ischaemic myocardial tissue to subsequent reperfusion and re-oxygenation, 
or may involve modulating the susceptibility of chronic ischaemic myocardial tissue (including forms of 

20 angina) to later more severe ischaemia, which would result in myocardial infarction. It is submitted that, 
by way of analogy, cerebral ischaemia may be treated using the same principle. 

Although the Applicant does not wish to be bound by this theory, the downstream effects of SM20 and 
related genes such as EGLN3 (BAB15101), clorfl2 (AAG34568), and CAB81622, namely, apoptosis 
and angiogenesis might be explained as follows. The apoptotic effect of NGF withdrawal may be 

25 mediated by induction of the hypoxia pathway, but may be an aspect of the supposed involvement of the 
MF protein in the stress response. HFla is induced by reactive oxygen species (see Richard et al. J Biol 
Chem 2000 Sep 1;275(35):26765-71). This could, in turn, be mediated by over-load of the proteosomal 
pathway for HlFla degradation and the consequent accumulation of undegraded HFla. Accordingly, it 
is considered that modulation of SM20 and the related genes EGLN3 (BAB15101), clorf!2 

30 (AAG34568), and CAB81622 may have applications in the treatment of diseases resulting from 
disturbances in proteosome function, such as prion diseases and other neurodegenerative diseases. 

These data provide the first connection between these related genes and the physiological response to 
hypoxia. Recently published research papers have identified that the protein products of these genes can 
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act as proline-hydroxylases (see Bruick RK et al Science. 2001 294:1337-40 and Epstein AC et al Cell. 
107:43-54). This is consistent with our observations that certain proline hydroxylases are induced in 
response to hypoxia and the genes EGLN1 and EGLN3 are part of the hypoxia response. For example, 
two genes encoding proline hydroxylases have been identified herein as being increased in expression in 
5 response to hypoxia (proline 4-hydroxylase, alpha polypeptide i; SeqID: 231/232, proline 4-hydroxylase, 
alpha polypeptide II; SeqID: 349/ 350). This identified a functional significance of proline hydroxylation 
as a response to hypoxia. 

Proline hydroxylase leads to degradation of HIFla in normoxia (HIF regulates its own degradation - 
feedback). Hydroxylated HDPla + VHL leads to ubquitination and consequent degradation of HIFla by 
10 proteosome. The activity of the prolyl hydroxylase is 0 2 -dependent, so under conditions of hypoxia, 
HIFla is not hydroxylated efficiently and is stabilised. HIFla protein thus accumulates to a high level. 
The hypoxia-induction of the prolyl hydroxylase ensures that when 0 2 concentration returns to normal, 
there is sufficient enzyme available to target this high level of HIF1 a efficiently for rapid degradation. 

Degradation of HIFla is dependent on HIFl-induced transcription (i.e. is hypoxia inducible). Berra et al 
15 (FEBS Lett 2001 Feb 23;491(l-2):85-90) raises the specific hypothesis of an unknown hypoxia-inducible 
factor which targets HIFla for proteosomal degradation. It appears reasonable to propose that this factor 
will clearly be hypoxia-inducible, to ensure that a rapid and effective constraint on the hypoxic response 
would operate on return to normoxia. It now appears as if the genes EGLN 1 and EGLN3 form part of this 
mechanism. 

20 It is also hypothesised that SM20 and the related genes EGLN3 (BAB15101), clorfl2 (AAG34568), and 
CAB8I622 may act as tetramers. Known prolyl hydroxylases such as prolyl 4-hydroxylase (P4H) are 
known to act as tetramers of two alpha subunits and two beta subunits. SM20 and the related genes 
exhibits high similarity to the alpha subunit of P4H and it therefore seems likely that SM20 and the 
related genes are likely to have a binding partner that is equivalent to the beta subunit of P4H. 

25 SM20 has been shown to bind to the transcription factor HIFla, and shares a low level homology with a 
p53 binding protein. P53 is a transcription factor that is known to be involved in apoptosis. Accordingly, 
it is proposed that in addition to binding to HIF1A, SM20 and the related genes EGLN3 (BAB 15101), 
clorfl2 (AAG34568), and CAB81622 may also bind and modify other transcription factors that are 
involved in the hypoxic response such as EPAS and HIF3A, or other transcription factors such as p53 and 

30 thereby influencing apoptosis. This aspect of the invention thus provides dimer and tetrameric forms of 
the EGLN3 (BAB15I01), clorfl2 (AAG34568), and CAB81622 proteins, preferably complexed with a 
protein selected from the group consisting of HIFla, p53 and a protein binding partner that is equivalent 
to the beta subunit of P4H. Preferably, such dimers and tetramers are heterodimers/heterotetramers. 
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To provide further evidence that these related genes are a significant part of the hypoxia response 
additional expression data is presented here. 

Expression profiles for these two genes will be displayed with pre-chip normalisation to correct for 
differences in RNA labelling etc, but within each gene no further normalisation is done (per-gene 

5 normalisation), so the relative absolute expression levels of the two genes can be compared and Y-axis 
units between separate graphs from the same experiment are comparable. These graphs are presented as 

' Figures 32b (clorfl2) and 32c (EGLN3). 

It can be seen from these Figures that both genes (clorf!2 and EGLN3) are inducible in response to 
hypoxia in macrophages whether activated by gamma interferon and lipopolysaccharide or if de-activated 
10 by treatment with interleukin-10. In macrophages the absolute expression level of Clorfl2 appears to be 
higher than EGLN3. 

There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we 
show herein that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating 
their utility in designing specific therapeutic products for diseases involving those cell types. 

15 From Figures 32a and 32d and the data presented below, differing expression profiles of the two related 
genes cI0RF12 and EGLN3 are apparent throughout the 1 1 tested cell types, though Clorfl2 is generally 
expressed at higher levels than EGLN3. 



Cell type 
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Oxygen mRNA expression mRNA expression 

(cIORF12SeqID:89/90)(EGLN3SeqID:85/86) 





adipocyte 


normoxia 


0.0075 


0.0033 




adipocyte 


hypoxia 6hr 


0.0091 


0.0027 




adipocyte 


hypoxia 1 8hr 


0.0182 


0.0025 




cardiomyocyte 


normoxia 


0.0067 


0.0019 


25 


cardiomyocyte 


hypoxia 6hr 


0.0381 


0.0023 




cardiomyocyte 


hypoxia 18hr 


0.0201 


0.0026 




endothelial 


normoxia 


0.0198 


0.0019 




endothelial 


hypoxia 6hr 


0.0583 


0.0033 




endothelial 


hypoxia 18hr 


0.0397 


0.0026 


30 


fibroblast 


normoxia 


0.0119 


0.0032 




fibroblast 


hypoxia 6hr 


0.0260 


0.0046 




fibroblast 


hypoxia I8hr 


0.0235 


0.0040 




hepatocyte 


normoxia 
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0.0080 
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hepatocyte 


hypoxia 6hr 


0.0074 


0.0146 




macrophage 


normoxia 


0.0033 


0.0008 




macrophage 


hypoxia 6hr 


0.0083 


0.0018 




macrophage 


hypoxia I8hr 


0.0058 


0.0021 
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mammary epithelial 


normoxia 


0.0065 


0.0014 




mammary epithelial 


hypoxia 6hr 


0.0137 


0.0055 




mammary epithelial 


hypoxia I8hr 


0.0144 


0.0065 




monocyte 


normoxia 


0.0027 


0.0006 




monocyte 


hypoxia 6hr 


0.0084 


0.0014 


10 


monocyte 


hypoxia I8hr 


0.0080 


0.0016 




neuroblastoma 


normoxia 


0.0344 


0.0011 




neuroblastoma 


hypoxia 6hr 


0.1085 


0.0013 




neuroblastoma 


hypoxia 18hr 


0.0551 


0.0020 




renal epithelial 
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HVJ III v AIM 


0.0275 


0 0046 


15 


renal epithelial 


hypoxia 6hr 


0.0560 


0.0046 




renal epithelial 


hypoxia 1 8 h r 


0.0395 


0.0096 




skeletal myocyte 


normoxia 


0.0088 


0.0029 




skeletal myocyte 


hypoxia 6hr 


0.0277 


0.0035 




skeletal myocyte 


hypoxia 1 8h r 


0.0245 


0.0038 



20 



For instance, in the hypoxic hepatocyte (6hr) the normalised expression values of EGLN and clorfl2 are 
0.015 and 0.0074 respectively, i.e. EGLN being the dominant gene. In contrast, in the neuroblastoma cell 
line SH-SY5Y, the normalised expression values of EGLN and c 1 orf 1 2 after 6hr hypoxia are 0.0012 and 
0.108 respectively, i.e. clorf!2 being the dominant gene by a large margin. This data demonstrates that 
25 clORF12 and EGLN3 are not constitutively expressed at an equal amount in different tissues indicating 
specificity of function. Therefore, it is considered that therapeutic products may be developed based on 
this data, with the goal of modulating proline hydroxylation of target proteins (such as HIFIalpha) in 
specific tissues, based on the differing expression profile of clORF12 and EGLN3 in those tissues. 

In Example lb herein, genes were identified from a custom array, which give a greater induction in 
30 macrophages (by a factor of at least 1.5) when hypoxia is augmented by over-expression of HIFIalpha or 
EPAS from an adenovirus. The data from the HIF/ EPAS over-expression work is presented herein in 
Example lc, but specifically relating to cIORFI2 and EGLN3 is summarised in Figures 32e and 32f. 
From this data it is apparent that EGLN3/ FLJ21620 fis cl.COL07838 but not clORF12 is increased in 
expression by the transcription factor EPAS1 but not HIFIalpha. This is apparent by comparing 
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experimental condition 9 (hypoxia with EPAS overexpression; expression value=3.48) to that of 5 
(hypoxia without EPAS overexpression; expression value= 1.65). This adds valuable information about 
the mechanism of regulation of the gene encoding EGLN3. 

To confirm this data the RNA samples for experimental conditions 1,3*5,7,9 (corresponding to the high 
5 dose of adenovirus) were also measured using a different array-based methodology- the AffyMetrix 
GeneChip. The results of this experiment are presented in Figures 32g and 32h. 

Functional Characterisation ofEGL nine (C.elegans) homolog 3 role in the induction of Cardiomyocyte 
apoptotic cell death 

EGLN3 has been cloned into p0NY8.1 and Smart2.1RES.GFP equine infectious anaemia virus (EIAV) 
10 vectors, and AdCM V.TRACK.GFP (AdenoQuest) adenoviral genome vectors (see co-owned co-pending 
International patent application PCT/GB01/00758). These vectors have been used in "gain-of-function" 
studies in which EGLN3 has been overexpressed in order to elucidate corresponding protein function. 
Human embryo kidney (HEK 293T) and dog osteosarcoma (D17) cell lines have been used in transient 
plasmid transfection experiments to confirm EGLN3 expression from viral vector genomes. Rat 
15 cardiomyocyte cell line (H9C2) and primary human neonatal cardiomyocytes (PHNC) (BioWhittaker, 
CC2582) have been used in viral transduction experiments to determine the biological activity of EGLN3. 
In all cell types, expression of EGLN3 has been followed by combinations of immunofluorescence, 
Western blotting and TaqMan quantitative PCR. Immunofluorescence and Western blotting employ an 
antibody specific for the FLAG epitope engineered into the 3' terminus of EGL nine (C.elegans) homolog 
20 3 (Sigma, F3165). TaqMan quantitative PCR utilises the SYBR Green method (Applied Biosystems). 

Western blotting has confirmed the transient expression of EGLN3 from an EIAV genome construct in 
HEK 293T (expected size approx 717 bp, 26 Kda). Immunofluorescence has localised transient 
expression of EGL nine (C.elegans) homolog 3 from EIAV expression construct in HEK293T to the 
cytoplasm. Expression of EGL nine (C.elegans) homolog 3 is elevated after 4 hours exposure to hypoxic 

25 conditions (0.1% (v/v) oxygen), when compared to expression observed under normoxia (20% (v/v) 
oxygen) (see Figure 32i). TaqMan primers have been designed and optimised for the initial measurement 
of EGL nine (C.elegans) homolog 3 expression in EIAV or Adenovirus transduced H9C2 and PHNC 
(Forward: TCATCGACAGGCTGGTCCTC; Reverse: GTTCCATTTCCCGGATAGAA). All findings at 
the RNA level are corroborated by immunofluorescence and Western blotting analyses at the protein 

30 level. 

EIAV transduction of H9C2 and PHNC has been optimised with constructs containing green fluorescence 
protein (GFP) and LacZ reporter genes, using the VSVg envelope and a range of MOI between 10 and 
100. GFP results were scored by fluorescence microscopy, while LacZ transductants were identified 
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through the assay of p-galactosidase activity. An MOI of 50 transduced approximately 50% of the cell 
population. 

EGLN3 is predicted to have pro-apoptotic activity in cardiomyocytes. Early, Mid and late phase apoptosis 
are characterised by translocation of membrane phospholipid phosphatidylserine (PS) from the inner face 
5 of the plasma membrane to the cell surface, activation of specific proteases (caspases) and fragmentation 
of DNA, respectively (Martin, S.J., et al M J. Exp. Med. 1 995, 182, 1545-1556; Alnemri, E.S., et ah, J. 
Cell. Biochem. 1997, 64, 33-42; Wylie, A.H., et a)., Int. Rev. Cytol. 1980, 68, 251-306). Translocation of 
PS has been identified through use of ApoAlert kit (Clontech; K2025-1), which employs FITC-labelled 
antibodies to detect surface expression of the PS, Annexin V. Caspase activity has been followed using 

10 the homogeneous fluorimetric caspase assay (Roche; 3005372) which allows the quantification of caspase 
activity through the cleavage of a fluorescent substrate. DNA fragmentation has been estimated using the 
nuclear stain Hoescht 33345 (Sigma, B2261; and fluorescence microscopy to locate areas of chromatin 
condensation. Total viability of cell population has been quantified through measurement of the ability of 
mitochondrial reductase to metabolise the fluorescent substrate MTT (Sigma, M2128)(Levitz S.M & 

15 Diamond, R.D. J. Infect. Dis. 1985 Nov; 152(5):938-45). 

Conditions for early, mid and late stage apoptosis in H9C2 and PHNC have been defined using hypoxia 
and nutrient-depleted growth medium to mimic those ischaemic conditions found in vivo (Brar, B.K., et 
al., J. Biol. Chem. 2000, 275, 8508-8514). Transduction of PHNC with EIAV vectors containing EGLN3 
is sufficient to cause an increase in caspase activity in cells cultured under normoxic conditions, 
20 confirming the role of EGLN3 in the induction of cardiomyocyte apoptosis. Using an MOI of 50, a 2-fold 
increase in caspase activity was seen in EGLN3 transduced cells, when compared to controls 48 hours 
post transduction (see Figure 32j). 

Increased expression of EGL nine (C.elegans) homolog 3 in transduced cells is confirmed by TaqMan, 
immunofluorescence and Western blotting. Similar experiments are performed to determine whether EGL 
25 nine (C.elegans) homolog 3 expression further sensitises H9C2 and PHNC to previously defined 
ischaemic insults. Staurosporine (Calbiochem; 569397) and Smart2.IRES.GFP EIAV vectors containing 
the Bax gene will be applied as chemical and viral pro-apoptotic controls, respectively (Yue, T-L., et al., 
J. Mol. Cell. Cardiol. 1998, 30, 495-507; Reed, J.C. J Cell Biol. 1994, 124(l-2):l-6). 

30 Gene silencing approaches may be undertaken to down-regulate endogenous expression of EGLN3 in 
PHNC to determine the degree of protection against apoptotic cell death provided by a reduction in 
EGLN3 activity. RNA interference (RNAi) (Elbashir, SM et al., Nature 2001, 411, 494-498) is one 
method of sequence specific post-transcriptional gene silencing that may be employed. Short dsRNA 
oligonucleotides are synthesised in vitro and introduced into a cell. The sequence specific binding of 
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these dsRNA oligonucleotides triggers the degradation of target mRNA, reducing or ablating target 
protein expression. A Hammerhead ribozyme library, contained in E1AV expression vectors, may also be 
applied. Efficacy of both gene silencing approaches may be assessed initially through the measurement of 
EGLN3 expression, at the RNA level by TaqMan and at the protein level by Western blotting. Protection 
5 against previously described ischaemic insults provided by these methods of EGLN3 gene silencing may 
be assayed biologically as detailed above. Caspase inhibitors (caspase 3 inhibitor V, 2129002 and caspase 
inhibitor I, 627610, both Calbiochem) and Smart2.IRES.GFP EIAV vectors containing the BcI-2 gene 
may be applied as chemical and viral anti-apoptotic controls, respectively (Kroemer, G. Nat Med. 1997, 
3(6):614-20). 

10 Similar "gain-of-function" and gene silencing approaches will be applied to the related gene, encoded by 
SEQ ID 90, named dofl2. 

Genes with a greater response in renal epithelial cells 

The dataset of Table 12 also contains genes which are induced preferentially in renal epithelial cells, in 
response to hypoxia. These genes are presented in Figures 39-44. 

15 SeqID:l 17/1 18 EST 

SeqID:129/130 Hypothetical protein FU22622 

SeqlD:31/32 TRIP-Br2 

SeqID:30I/302 Tumor protein D52 

SeqID:91/92/92aSemaphorin4b 
20 SeqID:371/372 Dec-i 

For Semaphorin 4b (SeqID:91/92/92a), the clone presented in Figure 43 is p 1 P 1 4 , corresponding to 
IMAGE clone acc BE910319, the sequence of which covers a large region of the gene including protein 
coding sequence, which may cross-hybridise to other members of the semaphorin family. A separate 
clone (pIDI7) as found in the original filing, was derived from the subtracted library and corresponds to a 
25 more unique region of this gene in the untranslated region. From Table 12 it will be appreciated that a 
significant response is also found in the macrophage. This is validated by RNase protection assay data 
(see Figure 57). Further clarification of this gene using complementary experimentation methods will 
resolve the exact cell-type specific nature of the expression of this gene, though it is clear from this data 
that it is induced in renal epithelial cells and macrophages. 

30 Genes with a greater response in mammary epithelial cells 

The dataset of Table 12 also contains genes which are induced preferentially in mammary epithelial cells, 
in response to hypoxia. These genes are presented in Figures 45-52. 
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SeqlD:447/448 Calgranulin A 
SeqID:67/68 EROi (S. cerevisiae)-Iike 

SeqID:25/26 Hypothetical protein FLJ20500 

SeqID:229/230 N-rayc downstream regulated 
5 SeqID:387/388 Decidual protein induced by progesterone 
SeqID:379/380 Integrin, alpha 5 
SeqID:225/226 Tissue factor 
SeqID:237/238 COX-2 

In the case of Cox-2, which encodes a key drug target, it can be seen that in many cell types, especially 
10 the mammary epithelial cells, there is a clear induction in response to hypoxia. In contrast, for endothelial 
cells there is a very significant time-dependent decrease in Cox-2 gene expression in response to 
hypoxia. Similarly, for Calgranulin A, there is strong positive induction in hypoxic mammary epithelial 
cells, but in the macrophage, the response to hypoxia is negative. These clearly exemplify the unexpected 
■ finding that cell types respond to hypoxia differentially, both quantitatively but also qualitatively. This is 
15 not currently known. 

Hypoxia regulation of Novel human genes 

From Table 12, it will be appreciated that several genes with no prior annotation in public domain gene 
sequence databases are now identified as being regulated by hypoxia, in at least one cell type. To make 
this clear, these genes have been copied from Table 12 and presented in Tables 13 and 14), showing the 

20 hypoxia/ normoxia induction ratio of the cell type in which the response is most pronounced. These 
figures are derived by dividing the normalised expression value, as found in Table 5, in hypoxia by that in 
normoxia for the same cell type. In some cases, where hypoxia causes inhibition of gene expression, the 
fold change is prefixed by the term "DOWN". The cell type and time point of maximal response to 
hypoxia are also noted in Tables 13 and 14. The main purpose of Tables 13 and 14 is to demonstrate that 

25 these genes have significant responses to hypoxia per se. 

In many cases, significant responses are seen in multiple cell types, though this data is not apparent here. 
In Table 13, the cDNA clones are currently un-annotated in public domain databases. In Table 14, the 
cDN A clones are currently annotated, but were not so as at the priority date. 

Example 4: Additional disclosure of the effect of macrophage activation on hypoxia regulation of 
30 gene expression 

In Example 2, it is shown that activated and resting macrophages respond to hypoxia in different ways, 
showing that the hypoxia response is not a generic phenomenon. To consolidate this data, experiments 
were performed with the custom array, using additional experimental conditions and with a more in-depth 
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analysis. Significantly, the expression values used are not simple hypoxia/ normoxia ratios, done 
separately for macrophages of differing activation status, but rather the values used allow comparison of 
the relative expression levels throughout the entire set of experimental conditions. Hence, for any gene, 
all values throughout the entire set of experimental conditions are calculated by comparison to (he median 

5 level of that gene throughout the dataset. This allows a clearer appreciation of the effects of hypoxia in 
the context of cell activation status. The following data demonstrates that of the newly discovered genes 
responsive to hypoxia, expression changes are also seen in response to key cytokines of the immune 
system, implying functions outside of the generic response to hypoxia and metabolism. This especially 
applies to unannotated genes, including ESTs and hypothetical proteins, showing potential functions in 

10 inflammation and angiogenesis on the basis of cytokine-regulation. 

Macrophages were derived and cultured as described elsewhere in the specification. A total of 6 
experimental conditions were analysed, as shown below. Where cells were treated with cytokines or 
hypoxia (0.1% oxygen), this was for 6 hr. Lipopolysaccharide (LPS) (from E.coli 026:B6; Sigma), 
gamma Interferon (1FN) and Interleukin-10 (1L-10) were all used at a final concentration of lOOng/ml. 
15 The effect of gamma Interferon and Lipopolysaccharide is to activate macrophages, with a Thl biased 
phenotype, as found in many inflammatory conditions. Interleukin-10 is a Th2 cytokine and de-activates 
macrophages, and suppresses their effector functions. 

Experimental 
Condition . 



20 I. No cytokines Normoxia 

2. No cytokines Hypoxia 

3. IL-10 Normoxia 

4. IL-10 Hypoxia 

5. LPS+IFN Normoxia 
25 6. LPS+IFN Hypoxia 



In Table 15, genes are shown which respond to LPS+IFN in normoxia by producing at least a 2-fold 
increase in expression, indicating probable pro-inflammatory functions. From this dataset various patterns 
of hypoxia regulation will be appreciated on top of the effect of LPS+IFN. 

For instance, the gene SCYA8 (p 1 12 1 ; SeqID: 479/480) is decreased in expression by hypoxia, changing 
30 from 0.S4 to 0.18 between conditions #1 and 82. In condition #5 (LPS+IFN normoxia), expression is 
dramatically increased to a value of 19.6. When LPS+IFN is combined with hypoxia, this increase is 
dampened-down to a value of 12.2. So for this example, hypoxia and cell activation have opposing effects 
on gene expression. A similar expression profile is found for several other genes in Table 15. 
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In contrast, the gene P8 protein-candidate of metastasis 1 <ptF17; SeqID: 329/330) is increased in 
expression by hypoxia, changing from 0.26 to 1.78 between conditions #1 and #2. In condition #5 
(LPS+IFN normoxia) expression is increased from condition #1 to a value of 1.16. In condition #6, 
(LPS+IFN normoxia) the expression is further increased to a value of 2.59. So for this example, hypoxia 
5 and cell activation have similar effects on expression (i.e. increases) and these are found to be synergistic. 
A similar expression profile is found for several other genes in Table 15, including for Semaphorin 4b 
(plP14; SeqID :91/92/92a), which has been independently verified by RNase protection assay (see Figure 
57). . 

A selection of novel genes taken from Table 15 is also presented as Figure 53. These novel genes are 
10 hence annotated here for the first time as being regulated not only by hypoxia, but also by Thl 
inflammatory signals, as provided by LPS+IFN. 

It will be appreciated that certain IMAGE clones were classed as novel and unannotated when the original 
patent filing was made (8 Dec 2000), but which can now be assigned to named genes. These are Uridine 
5* monophosphate hydrolase 1 (clone plI7; SeqID: 49/50) and Insulin induced protein 2 (clone pIDIO; 
15 SeqID:75/76). 

In Table 16, genes are shown which respond to LPS+IFN in normoxia by producing at least a 2-fold 
decrease in expression. From this dataset, various patterns of hypoxia regulation will be appreciated on 
top of the effect of LPS+IFN. 

In Figure 54, novel genes from Table 16 which are down-regulated by LPS+IFN and up-regulated by 
20 hypoxia are presented. For most of these, the combined effect of LPS+IFN AND hypoxia produces only a 
minor induction above the level of expression for activated normoxic cells (for example plF8/ SeqID:10/ 
Hypothetical Protein KIAA0914). In other cases, this is not the case, and hypoxia is able to over-ride the 
inhibitory effect of LPS+IFN on gene expression (for example plD12/ SeqID:30/ Hypothetical Protein 
KIAA1376). This clearly demonstrates the finding that different cell types or physiological states of a cell 
25 " type (as here), respond to hypoxia differently. 

In Figure 55, novel genes from Table 16 which are down-regulated both by LPS+IFN and by hypoxia are 
presented. In many of the genes presented here, these stimuli are synergistic, with minimal expression 
obtained with a combination of LPS+IFN and hypoxia. 

In Figure 56, a selection of named genes from Table 16 which are down-regulated by LPS+IFN, with 
30 various responses to hypoxia are presented. For the gene, Max-interacting Protein 1 two separate clones 
were available on the array corresponding to this gene (plG5 from SeqID:280 and plD22 from 
SeqlD:120). In the original specification, the IMAGE clone corresponding to SeqID:I20 (accession 
AA401496) was classified as an EST, and the IMAGE clone corresponding to SeqID:280 (accession 
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AA40I496) was classified as "Max-interacting Protein 1", as determined by the UniGene database at that 
time. Now it is apparent that both of these clones correspond to Max-interacting Protein I, explaining the 
similarity of their expression profiles in Figure 56. Clearly the response of this gene to hypoxia is 
inhibited by LPS+IFN. 

5 The additional data showing effects of the Thl activation stimulus LPS+IFN extends the finding of these 
genes as novel hypoxia regulated genes, and provides additional information about the relevance of these 
genes to disease mechanisms. 

It will be appreciated that certain IMAGE clones were classed as novel and unannotated when the original 
patent filing was made (8 Dec 2000), but which can now be assigned to named genes. These are TRIP- 
10 Br2 (clone plD15; SeqID:31/32), MAX -interacting protein I (clone pID22; SeqID:l 19/120). 

In Tables 15 and 16 and Figures 53-56, showing genes which respond to LPS+IFN, it will be noticed that 
some of these genes also response to the inhibitory cytokine IL-10 (e.g. Semaphorin 4b, Hypothetical 
protein CGI-1I7). Other genes respond only to IL-10, but not to LPS+IFN. Specific responses to IL-10 
are significant because this cytokine has been shown to have utility in suppressing inflammatory reactions 
15 (Huizinga TW et aL, Rheumatology 2000, 39: 1 180-8). 

Table 17 shows genes responsive to IL-10 (increased or decreased) but not affected significantly by 
LPS+IFN. Various patterns of hypoxia regulation will be appreciated. 

Example 5: Gene expression in human tumors 

One of the utilities of the genes identified herein relates to the diagnosis and treatment of human tumors, 
20 on the basis that hypoxia is frequently found in tumors. 

A study has been performed to examine the expression of these genes in a selection of breast and ovary 
tumors, comparing expression with normal adjacent tissue from the same patient, There is expected to be 
a large degree of variation between different patients, and the study here contains only 5 patients with a 
range of diagnoses. Therefore although certain genes will be identified from this data, other genes in the 
25 current specification not flagged by this study are nevertheless likely to have utility in cancer. 

Patients are designated as Letters: 

E: 50 year old Caucasian female. Diagnosis: ovarian adenocarcinoma. Normal ovarian tissue derived 
from an age-matched separate individual. 

F: 60 year old female. Diagnosis: poorly differentiated adenocarcinoma. Normal ovarian tissue derived 
30 from the same individual. 
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G: 41 year old female. Diagnosis: moderately-differentiated adenocarcinoma. Normal ovarian tissue 
derived from the same individual. 

H: 40 year old female. Diagnosis: invasive ductal carcinoma. Normal breast tissue derived from the same 
individual. 

5 K: 58 year old female. Diagnosis: invasive ductal carcinoma. Normal breast tissue derived from the same 
individual. 

Data normalisation was done per-chip to correct for differences in labelling and hybridisation efficiency. 
Per-gene normalisation was done such that the expression values of each gene are relative to the median 
value of that gene throughout the series of samples. By comparing the expression values under normal 
10 (nor) and tumor (turn) for a single patient, differences in expression between the normal and malignant 
tissue of that patient can be inferred. 

In Table 18 are genes which are up-regulated at least 3-fold in at least one patient, comparing the tumor 
tissue to the adjacent normal tissue. 

In Table 19 are genes which are down-regulated at least 3-fold in at least one patient, comparing the 
15 tumor tissue to the adjacent normal tissue. 

Example 6: Effects of inflammatory cytokines on hypoxia-regulated genes 

Tumor necrosis factor alpha (TNFoc) is a key pro-inflammatory cytokine both produced by and acting on 
the macrophage. The significance of TNFot to human disease is well established in the art. 
This is particularly the case in Rheumatoid arthritis and neutralising antibodies to TNFahave been 

20 reported to offer clinical utility. Because hypoxia is another pathological condition 
exerted on macrophages in the synovia of RA patients, synergistic effects of these 
two stimuli are highly relevant to the discovery of novel inflammatory targets 
expressed by the macrophage. To investigate this, primary human macrophages 
we re exposed to either hypoxia (0.1% oxygen) or 100 ng/ml TNFaor to both for 6hr. The 

25 data shown below provides further credence to the utility of the encoded proteins as inflammatory targets 
in macrophages and applies to any disease where hypoxia and TNFaare co-in cidenL 

Gene expression levels were measured and compared using the custom gene array. In data analysis per- 
* gene normalisation was set up such that expression values represent the fold-change compared with the 
expression in untreated normoxic cells. Genes which are increased in expression in response to TNFaby 
30 at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 20. Genes which are decreased in 
expression in response to TNFaby at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 
21. 
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Another inflammatory cytokine implicated in diseases where hypoxia is frequently 
found is Interleukin-1 7 (IL-17). For example, this cytokine has been shown to 
. mediate inflammation and joint destruction in arthritis (Lubberts et al J.Immunol 
200 1 1 67:1 004-1 01 3). IL-17 has also been shown to stimulate macrophages to 
5 release other key pro-inflammatory cytokines (Jovanovic et a I J Immunol 1998 
1 60:35 1 3-2 1). Therefore genes which respond to both hypoxia and IL-17 are 
especially likely to be relevant to disease processes and have utility in the design of 
therapeutic products. Genes which are increased in expression in response to IL-17 by at least 2- 
fold, in either normoxic or hypoxic cells, are shown in Table 22. Genes which are decreased in expression 
10 in response to IL-17 by at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 23. 

The cytokine IL-15 is implicated in several disease in which macrophages and hypoxia both feature as 
elements of the inflammatory state, such as in atherosclerosis (Wuttge DM et al Am J Pathol 2001 
159:417-23) and rheumatoid arthritis (Mclnnes IB et al Immunol Today. 1998 19:75-9). Although the 
main target of IL-15 is T-cells effects have also been shown on monocytes (Badolato R et al Blood. 1997 
15 90:2804-9). Therefore genes which respond to both hypoxia and IL-15 are especially 
likely to be relevant to disease processes and have utility in the design of 
therapeutic products. Genes which are increased in expression in response to IL-15 by at least 2- 
fold, in either normoxic or hypoxic cells, are shown in Table 24. Genes which are decreased in expression 
in response to IL-15 by at least 2-fold, in either normoxic or hypoxic cells, are shown in Table 25. 

20 Example 7: Rat foetal cardiomyocytes 

Primary rat foetal cardiomyocytes provide an attractive experimental model for studying the responses of 
cardiac cells to ischaemia. Cells are obtained which are non-immortalised and which are seen to contract 
or beat in culture. It is of interest to examine how the responses of these cells to hypoxia (or related 
experimental conditions) compared and contrasts to other cell types. These other cell types might include 

25 those that are similarly sensitive to the effects of hypoxia (such as neurones) or might be cells that show a 
higher tolerance to hypoxia (such as macrophages). Experiments are performed in parallel for 
cardiomyocytes and other cell type(s). The responses of these specific cell types is then determined by 
hybridising labelled mRNA to microarrays. Alternative methods will include the construction of 
subtracted cDNA libraries for the individual treated cell types and assessing which genes are contained 

30 therein by sequencing. 

Methods 

Cardiomyocytes are harvested from heart ventricles of embryos aged E18 days, using a cell isolation kit 
(Neonatal cardiomyocyte isolation system; Worthington Biochemical Corporation, Lakewood, New 
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Jersey, 08701). They are seeded at 5xl0 6 cells/lOOcm diameter petri dish in DMEM/M 199, 10% horse 
serum, 5% FCS, 1% penicillin, streptomycin, glutamine for 5 days at 37C. Media is changed during the 5 
days. 

Other cell types used for comparison with cardiomyocytes, are cultured according to their optimum 
5 conditions and/ or the standard routine. These cell types may include cardiomyocytes in a different 
physiological setting, such as in an intact beating heart, or a different developmental state of the 
cardiomyocyte, such as cardiomyoblast. 

Identical seeded petri dishes are placed either in a standard tissue culture incubator (95% air/ 5% C02) or 
in a hypoxia incubator (0.1% oxygen / 5% C02 / 0.1% oxygen for 6 hours. This is done separately for 
10 both cardiomyocytes and the other cell type(s) to be compared. Other experimental conditions might more 
closely approximate ishemia, by incorporating components additional to hypoxia. 

At the end of the exposure to hypoxia, cells are placed on a chilled platform, washed in cold PBS and 
total RNA is extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 
manufacturers instructions. Where appropriate, polyadenylated mRNA is extracted from the total RNA 
15 using a commercial kit following the manufacturers instructions (Promega; PolyATract mRNA isolation 
System IV). 

Array hybridisations and construction/analysis of subtracted cDNA libraries are performed according to 
standard methods or as described elsewhere in this specification. 

Example 8: Comparison of the hypoxic-responses between populations of rat primary cultured 
20 neurons by a subtraction cloning / array screening approach. 

Different regions of the central nervous system display different sensitivities to hypoxia and to ischaemia. 
Susceptibility to tissue- damage in this manner may occur as a result of intrinsic differences in gene 
expression between cells. To evaluate this hypothesis, primary cultures of rat neurons from different 
regions of the brain are established. Cultures are exposed to various experimental conditions which are 

25 pertinent to pathologies of the hypoxic/ischemic brain. These would include hypoxic insults as have been 
described, or to hypoxia/ischaemia where the conditions more closely approximate pathological ischemia. 
Either condition may be preceded by prior hypoxic-preconditioning, where transient exposure to hypoxia 
renders cells less sensitive to subsequent acute treatment. For all possible experimental treatments, a 
similar routine is performed for distinct neuron subtypes, in order to compare their responses. Such 

30 comparisons may be made by hybridizing labelled mRNA to microarrays or derivatives thereof. 
Alternatively subtracted libraries might be constructed individually for each treated neuron subtype, and 
clones which are confirmed to be changed in expression to be sequenced. The collection of genes arising 
from the different neuron subtypes will be compared. 
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Methods 

Primary cultures are established according to standard procedures from embryonic rats aged from E14 to 
E18 (Dunnett SB, Bjorkland A (Eds.) 1992. Neural Transplantation, A Practical Approach. IRL Press). 
Isolated neurons include but are not limited to those from ventral mesencephalon, striatum, hippocampus, 
5 cerebellum, cerebral cortex, dorsal root ganglia and superior cervical ganglia. 

Cells are maintained in culture for 3-14 days in humidified culture incubators at 37°C, 5% C02, 95% air 
(Normoxia) in Neurobasal Medium (Brewer GJ, 1995, Journal of Neuroscience Research 42:674-83) 
supplemented with B27 (both Life Technologies). For the hypoxia-preconditioning, cells are transferred 
to a second incubator at 37°C, 5% C02, 94.9% Nitrogen, 0.1% Oxygen (Hypoxia) for 30-180 minutes 

10 and returned to the normoxic incubator for 24 hours (Pringle et a/., 1997, Neuropathology and Applied 
Neurobiology 23:289-298). For the hypoxic stimulus, either independent from or subsequent to hypoxia- 
preconditioning, cells are transferred to the hypoxic incubator for 2-6 hours as determined in time course 
experiments. Additionally, as appropriate, the medium in which the cells are grown is replaced with 
glucose-free media for establishment of experimental ischaemia (Ray AM, Owen DE, Evans ML, Davis 

15 JB Benham, 2000. Caspase inhibitors are functionally neuroprotective against oxygen glucose 
deprivation induced CA1 death in rat organotypic hippocampal slices). At the end of the exposure to 
hypoxia (or hypoxia/ischaemia), ceils are, placed on a chilled platform, washed in cold PBS and total 
RNA is extracted using RNazol B (Tel-Test, Inc; distributed by Biogenesis Ltd) following the 
manufacturers instructions. Where appropriate, polyadenylated mRNA is extracted from the total RNA 

20 using a commercial kit following the manufacturers instructions (Promega; PolyATract mRNA isolation 
System IV). 

Array hybridisations and construction/analysis of subtracted cDNA libraries are performed according to 
standard methods or as described elsewhere in this specification. 

Example 9: Semaphorin 4b 

25 We have screened cDNA libraries derived from the human brain and leukocytes, to obtain an unequivocal 
and accurate full length cDNA sequence (SEQ ID No 92a) and the accurate presumptive amino acid 
sequence (SEQ ID No 91). 

The amino acid sequence above was derived by taking the first ATG. We have various independent lines 
of evidence that this is the bona fide translation initiation codon. 

30 Basic analysis of this sequence, reveals the following motifs: 
signal peptide (pSORT) Start: 1 End: 37; 
Transmembrane (pSORT) Start: 718 End: 734; 
cleavage site (pSORT) Start: 38 End: 38; 
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Proline rich region Start: 758 End: 824; 
Sema domain (pfam) Start: 70 End: 503; 
Plexin repeal (pfam) Start: 525 End: 548; 
integrin, beta domain (pfam) Start: 532 End: 546; 
5 cytoplasmic tail Start: 735 End: 837. 

To confirm the hypoxic regulation of Sema4b, we used RNase protection assay (see Figure 57). Hypoxia 
is a feature of several inflammatory conditions often accompanied by superoxide radicals and the immune 
regulator gamma interferon. In this experiment we have made the following findings: 

• Expression is activated by hypoxia (3.3 fold) 

10 • Expression is activated by gamma interferon and LPS.(3.9 fold) 

• Expression is activated synergistically by hypoxia plus gamma interferon/ LPS (7.3 fold) 

• Expression is activated by superoxide radicals (5.0 fold) 

To investigate the size of the mRNA and the tissue distribution, Northern blotting was done (see Figure 
58). This shows that the gene is expressed as a single transcript at relatively low levels in unstimulated 
15 human tissues. 

We have also found that a molecule that is probably associated with Semaphorin 4B, called psd-95'is 
another macrophage hypoxia-induced protein (see SEQ ID No 299). This is based on the fad that psd-95 
binds the cytoplasmic tail of Sema4c (Inagaki et aI M J Biol Chem. 2001; 276(12): 9174-81), which like 
Sema4b, contains proline rich sequence. Therefore, both Semaphorin 4B, and a probable partner are co- 
20 ordinately regulated by hypoxia. 

Example 10: Discussion of relevance of individual clones 

The Oxford BioMedica clone plF12 represents Hypothetical protein FLJ13611. The protein sequence 
encoded by Hypothetical protein FL J 13611 is represented in the public databases by the accession 
NP_079217 and is described in this patent by Seq ID I. The nucleotide sequence is represented in the 
25 public sequence databases by the accession NM_02494i and is described in this patent by Seq ID 2. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plF2 represents Hypothetical protein FLJ20037. The protein sequence 
30 encoded by Hypothetical protein FLJ20037 is represented in the public databases by the accession 
CAB6598I and is described in this patent by Seq ID 3. The nucleotide sequence is represented in the 
public sequence databases by the accession NM _01 7633 and is described in this patent by Seq ID 4. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Hypothetical protein FLJ20037 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect 
5 increased activity of the gene product to have an anti-tumour effect. 

The Oxford BioMedica clone plFlO represents Hypothetical protein DKFZp434P0116. The protein 
sequence encoded by Hypothetical protein DKFZp434P0116 is represented in the public databases by the 
accession T46364 and is described in this patent by Seq ID 5. The nucleotide sequence is represented in 
the public sequence databases by the accession NM_017593 and is described in this patent by Seq ID 6. 
10 Hypothetical protein DKFZp434P0! 16 is predicted to be a kinase due to high structural similarity with 
other known kinases. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Hypothetical protein DKFZp434P01 16 is repressed in macrophages 
activated by LPS and gamma interferon. We expect it to have an anti-inflammatory role. 

15 The Oxford BioMedica clone p 1 F 1 9 represents Hypothetical protein KIAA0212. The protein sequence 
encoded by Hypothetical protein K1AA0212 is represented in the public databases by the accession 
BAA 13203 and is described in this patent by Seq ID 7. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_014674 and is described in this patent by Seq ID 8. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20. implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plF8 represents Hypothetical protein KIAA0914. The protein sequence 
encoded by Hypothetical protein KIAA0914 is represented in the public databases by the accession 
NP_055698 and is described in this patent by Seq ID 9. The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM_014883 and is described in this patent by Seq ID 10. 
Hypothetical protein KIAA0914 shows high structural similarity to Human Class I alpha 1,2- 
Mannosidase and conservation of active site and binding site residues, therefore we predict that 
Hypothetical protein K1AA09I4 will act as a mannosidase. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Hypothetical protein 
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KIAA09I4 is repressed in macrophages activated by LPS and gamma interferon. We expect the gene 
product to have an anti-inflammatory role. 

The Oxford BioMedica clone p I F5 represents Hypothetical protein FLJ20281. The protein sequence 
encoded by Hypothetical protein FLJ20281 is represented in the public databases by the accession 
5 XP_008736 and is described in this patent by Seq ID 11. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJH7742 and is described in this patent by Seq ID 12. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
10 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Hypothetical protein FLJ20281 is induced in 
.macrophages activated by TNFalpha. 

The Oxford BioMedica clone plF18 represents Hypothetical protein KIAA0876. The protein sequence 
15 encoded by Hypothetical protein KIAA0876 is represented in the public databases by the accession 
BAA74899 and is described in this patent by Seq ID 13. The nucleotide sequence is represented in the 
public sequence databases by the accession XMJ)35625 and is described in this patent by Seq ID 14. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. 

The Oxford BioMedica clone plF7 represents Spectrin, beta, non-erythrocytic 1. The protein sequence 
encoded by Spectrin, beta, non-erythrocytic 1 is represented in the public databases by the accession 
NP_003119 and is described in this patent by Seq ID 15. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_003128 and is described in this patent by Seq ID 16. 

25 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Spectrin, beta, non-erythrocytic 1 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect 

30 increased activity of the gene product to have an anti-tumour effect. 

The Oxford BioMedica clone plF21 represents Hematopoietic Zinc finger protein. The protein sequence 
encoded by Hematopoietic Zinc finger protein is represented in the public databases by the accession 
AAL08625 and is described in this patent by Seq ID 17. The nucleotide sequence is represented in the 
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public sequence databases by the accession AK024404 and is described in this patent by Seq ID 18. 
Hematopoietic Zinc finger protein is a transcriptional regulator that contains a Cys2-His2 zinc finger 
motif. It is predicted to bind to metal response elements (MRE) and therefore activate the transcription of 
genes that contain a MRE sequence within their promoter region such as metallothioneins. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus-are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 
a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic products for diseases involving those cell types. Monocytes and 
10 macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD. Hematopoietic Zinc finger protein is preferentially induced by hypoxia in 
monocytes or macrophages and a restricted number of other cell types. It is therefore a candidate for 
specific intervention for treatment or diagnosis of the above diseases. 

The Oxford BioMedica clone plF9 represents Hypothetical protein KIAA0742. The protein sequence 
15 encoded by Hypothetical protein KIAA0742 is represented in the public databases by the accession 
NP_060903 and is described in this patent by Seq ID 19. The nucleotide sequence is represented in the 
public sequence databases by the accession AB01 8285 and is described in this patent by Seq' ID 20. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein KIAA0742 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. Hypothetical protein KIAA0742 shows 
25 significant homolgy to the transcription factor hairless. We therefore propose that Hypothetical protein 
K1AA0742 may play a crucial role in the regulation of hair growth. Accordingly, this aspect of the 
invention includes the use of this protein, fragments and functional equivalents of this protein, encoding 
nucleic acid molecules, in addition to ligands that bind specifically to this protein, in the diagnosis and 
treatment of hair loss. 

30 The Oxford BioMedica clone pi E 1 3 represents Hypothetical protein PRO0823. The protein sequence 
encoded by Hypothetical protein PRO0823 is represented in the public databases by the accession 
AAF71073 and is described in (his patent by Seq ID 21. The nucleotide sequence is represented in the 
public sequence databases by the accession AF1 16653 and is described in this patent by Seq ID 22. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
5 relevant. Hypothetical protein PRO0823 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Hypothetical protein PRO0823 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. We expect increased activity of the gene product to have an anti-tumour 
10 effect. 

The Oxford BioMedica clones plDI and plD2 represent the Hypothetical protein FLJI0134. The protein 
sequence encoded by Hypothetical protein FLJ10134 is represented in the public databases by the 
accession NP_060474 and is described in this patent by Seq ID 23. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_018004 and is described in this patent 

15 by Seq ID 24. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. HIFlalpha and EPASI are transcription factors that mediate the response to hypoxia 
of several genes, and have them selves been implicated in specific diseases. By adenoviral over- 
expression of EPASI we show augmentation of the hypoxic induction of certain genes, further 

20 confirming their status as responsive to hypoxia. Hypothetical protein FLJ10134 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of EPASI. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. Hypothetical protein 

25 FLJI0134 is repressed in macrophages activated by LPS and gamma interferon. We expect it to have an 
anti-inflammatory role. Hypoxia is frequently found in human tumours where macrophage infiltrates are 
also found. In a series of 5 patients with either ovarian or breast cancer, Hypothetical protein FLJ10134 is 
down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. We 
expect increased activity of the gene product to have an anti-tumour effect. 

30 The Oxford BioMedica clone pi D4 represents Hypothetical protein FLJ20500. The protein sequence 
encoded by Hypothetical protein FLJ20500 is represented in the public databases by the accession 
NPJ)6I931 and is described in this patent by Seq ID 25. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_019058 and is described in this patent by Seq ID 26. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypothetical protein FLJ20500 is preferentially induced by hypoxia in mammary epithelial 
cells. 

The Oxford BioMedica clone plD9 represents Hypothetical protein DKFZP564D1 16. The protein 
5 sequence encoded by Hypothetical protein DKFZP564D1 16 is represented in the public databases by the 
accession T08708 and is described in this patent by Seq ID 27. The nucleotide sequence is represented in 
the public sequence databases by the accession AL050022 and is described in this patent by Seq ID 28. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 

10 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein DKFZP564D116 is repressed in macrophages activated by LPS and 
gamma interferon. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been 

15 shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Hypothetical protein 
DKFZP564D1 16 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plD12 represents Hypothetical protein KIAA1376. The protein sequence 
20 encoded by Hypothetical protein KIAA1376 is represented in the public databases by the accession 
BAA92614 and is described in this patent by Seq ID 29. The nucleotide sequence is represented in the 
public sequence databases by the accession AB037797 and is described in this patent by Seq ID 30. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
25 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein KIAA1376 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

30 The Oxford BioMedica clone pi D 1 5 represents TRIP-Br2. The protein sequence encoded by TRIP-Br2 is 
represented in the public databases by the accession NPJ55570 and is described in this patent by Seq ID 
31. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJH4755 and is described in this patent by Seq ID 32.. TRIP-BR2 is a PHD zinc finger and 
bromodomain interacting protein transcriptional regulator and is involved in the regulation of cell cycle 
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progression. Its hypoxia-regulation is likely to have important disease-relevant effects. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
response of renal epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the 
5 medullary tissue. TR1P-Br2 is preferentially induced by hypoxia in renal epithelial cells. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. TRIP-Br2 is 
repressed in macrophages activated by LPS and gamma interferon. 

10 The Oxford BioMedica clone plD16 represents Hypothetical protein FLJ20308. The protein sequence 
encoded by Hypothetical protein FLJ20308 is represented in the public databases by the accession 
XP_039852 and is described in this patent by Seq ID 33. The nucleotide sequence is represented in the 
public sequence databases by the accession AK000315 and is described in this patent by Seq ID 34. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

15 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein FLJ20308 is repressed in macrophages activated by LPS and gamma 

20 interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone pi J13 represents Hypothetical nuclear factor SBB122. The protein sequence 
encoded by Hypothetical nuclear factor SBBI22 is represented in the public databases by the accession 
NP_065128 and is described in this patent by Seq ID 35. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_020395 and is described in this patent by Seq ID 36. 
25 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi 122 represents Hypothetical protein KIAA1429. The protein sequence 
encoded by Hypothetical protein KIAA1429 is represented in the public databases by the accession 
30 BAA92667 and is described in this patent by Seq ID 37. The nucleotide sequence is represented in the 
public sequence databases by the accession AB037850 and is described in this patent by Seq ID 38. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 
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The Oxford BioMedica clone plJ6 represents Hypothetical protein FLJ10206. The protein sequence 
encoded by Hypothetical protein FLJ10206 is represented in the public databases by the accession 
AAH06108 and is described in this patent by Seq ID 39. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 8025 and is described in this patent by Seq ID 40. 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
10 relevant. Hypothetical protein FLJ10206 is repressed in macrophages activated by IL-17 and is also 
repressed in macrophages activated by IL-15. These are pro-inflammatory cytokines, and we expect the 
hypothetical protein FLJ1 0206 to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 15 represents Hypothetical protein FLJ10815. The protein sequence 
encoded by Hypothetical protein FLJ10815 is represented in the public databases by the accession 

15 BAA91830 and is described in this patent by Seq ID 41. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_01 823 1 and is described in this patent by Seq ID 42. 
Hypothetical protein FLJ10815 is structurally similar to an alpha / beta barrel structure. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

20 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Hypothetical 
protein FLJ10815 is repressed in macrophages activated by LPS and gamma interferon. We expect it to 
have an anti-inflammatory role. 

25 The Oxford BioMedica clone p 1 1 1 3 represents Hypothetical protein FLJ11100. The protein sequence 
encoded by Hypothetical protein FLJI1100 is represented in the public databases by the accession 
NPJ6070I and is described in this patent by Seq ID 43. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_018321 and is described in this patent by Seq ID 44. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic . 
products. 

The Oxford BioMedica clone pi 117 represents Hypothetical protein FLJ20644. The protein sequence 
encoded by Hypothetical protein FLJ20644 is represented in the public databases by the accession 
NP_060387 and is described in this patent by Seq ID 45. Hypothetical protein FLJ20644 is a putative 
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Serine/threonine phosphotase. Region 250 - 450 shows high structural similarity to other Serine/threonine 
phosphotases. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_0I79I7 and is described in this patent by Seq ID 46. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
5 utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 1 1 5 represents Hypothetical protein CGI-117. The protein sequence 
encoded by Hypothetical protein CGI- 1 1 7 is represented in the public databases by the accession 
Q9Y3C1 and is described in this patent by Seq ID 47. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_016391 and is described in this patent by Seq ID 48. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. HIF1 alpha and EPAS1 are transcription factors that mediate the response to hypoxia of several 
genes, and have them selves been implicated in specific diseases. By adenoviral over-expression of 
HIF1 alpha or EPAS1 we show augmentation of the hypoxic induction of certain genes, further 

15 confirming their status as responsive to hypoxia. Hypothetical protein CGI-117 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of either H IF 1 alpha or 
EPAS1. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 

20 relevant. Hypothetical protein CGH17 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone pi 17 represents Uridine 5' monophosphate hydrolase 1. The protein 
sequence encoded by Uridine 5' monophosphate hydrolase I is represented in the public databases by the 
accession NP_057573 and is described in this patent by Seq ID 49. The nucleotide sequence is 

25 represented in the public sequence databases by the accession NM_OI 6489 and is described in this patent 
by Seq ID 50. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

30 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Uridine 5' monophosphate hydrolase I is induced in macrophages activated by LPS 
and gamma interferon and is also is induced in macrophages activated by IL-15. We expect it to have a 
pro-inflammatory role, and its inhibition may have an anti-inflammatory effect. 
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The protein sequence encoded by Hypothetical protein K1AA00I4 is represented in the public databases 
by the accession NPJ)55480 and is described in this patent by Seq ID 51. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_014665 and is described in this patent 
by Seq ID 52. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone pi 14 represents Hypothetical protein HSPC196. The protein sequence 
encoded by Hypothetical protein HSPC196 is represented in the public databases by the accession 
NP_057548 and is described in this patent by Seq ID 53. The nucleotide sequence is represented in the 

10 public sequence databases by the accession NM_0I6464 and is described in this patent by Seq ID 54. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 

15 present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein HSPC196 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 18 represents Hypothetical protein FLJII296. The protein sequence 
encoded by Hypothetical protein FLJ11296 is represented in the public databases by the accession 
20 XP_004747 and is described in this patent by Seq ID 55. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_018384 and is described in this patent by Seq ID 56. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

25 The Oxford BioMedica clone pi 116 represents Hypothetical protein KIAA1668. The protein sequence 
encoded by Hypothetical protein KIA A 1 668 is represented in the public databases by the accession 
BAB33338 and is described in this patent by Seq ID 57. The nucleotide sequence is represented in the 
public sequence databases by the accession AB051455 and is described in this patent by Seq ID 58. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pllll represents SECIS binding protein 2. The protein sequence encoded 
by SECIS binding protein 2 is represented in the public databases by the accession AAK57518 and is 
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described in this patent by Seq ID 59. The nucleotide sequence is represented in the public sequence 
databases by the accession AF380995 and is described in this patent by Seq ID 60. SECIS binding protein 
2 is a crucial component in the complex required for the translation of mammalian selenoprotein mRNAs. 
Selenoproteins are important responders to redox conditions and many selenoproteins are known to 
5 protect from cell death. Our demonstration of the hypoxia induction of SECIS binding protein 2 opens 
new avenues for diagnosis and therapeutic intervention. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
10 or breast cancer, SECIS binding protein 2 is down-regulated in the malignant tissue as compared to 
adjacent normal lissue in at least one patient. We expect increased activity of the gene product to have an 
anti-tumour effect. 

The Oxford BioMedica clone plE8 represents cDNA: FLJ22249 fis, clone HRC02674. The sequence 
cDNA: FLJ22249 fis, clone HRC02674 is not represented in the public databases by a protein accession. 
15 The nucleotide sequence is represented in the public sequence databases by the accession AK025902 and 
is described in this patent by Seq ID 62. Hypoxia is an important feature of several diseases, and genes 
that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plE18 represents Plexin CI. The protein sequence encoded by Plexin CI is 
20 represented in the public databases by the accession NPJI05752 and is described in this patent by Seq ID 
63. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ)05761 and is described in this patent by Seq ID 64. Plexins are a large family of receptors for 
transmembrane, secreted, and GPI-anchored semaphores in vertebrates and play a significant role in 
signal transduction [Tamagnone et al 1999, Cell 99:71-80]. Elsewhere in this patent we disclose hypoxic 
25 regulation of a new semaphorin 4b, and we propose co-regulation of these molecules by hypoxia and their 
relevance to inflammatory disease, its diagnosis and therapy. Hypoxia is an important feature of several 
diseases, and genes that. respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
30 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Plexin CI is repressed in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plE16 represents cDNA DKFZp586E1624. The sequence cDNA 
DKFZp586EI624 is not represented in the public databases by a protein accession. The nucleotide 
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sequence is represented in the public sequence databases by the accession ALI 10152 and is described in 
this patent by Seq ID 66. Hypoxia is an important feature of several diseases, and genes that respond to 
this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. HlFlalpha and EPAS1 are transcription factors that mediate the 
5 response to hypoxia of several genes, and have them selves been implicated in specific. diseases. By 
adenoviral over-expression of EPAS1 we show augmentation of the hypoxic induction of certain genes, 
further confirming their status as responsive to hypoxia. Its preferential regulation by EPAS1 provides a 
route to preferential intervention, to avoid toxicity to other tissues. The cDNA DKFZp586El 624 has been 
shown to be induced by hypoxia to a greater degree following adenoviral over-expression of EPAS1. 
10 Endothelial cells are key to angiogenesis, a process implicated in several diseases associated with 
hypoxia, including cancer and rheumatoid arthritis. The cDNA DKFZp586E1624 is preferentially 
jnduced by hypoxia in endothelial cells. We expect this gene product to have a pro-angiogenic effect, and 
its inhibition to have an anti-angiogenic effect. 

The Oxford BioMedica clones plD5 and plD6 represent ER01 (S. cerevisiae)-like. The protein sequence 

15 encoded by ER01 (S. cerevisiae)-like is represented in the public databases by the accession NP_055399 
and is described in this patent by Seq ID 67. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_014584 and is described in this patent by Seq ID 68. EROI (S. 
cerevisiae)-Iike has been shown to be a flavin adenine dinucleotide (FAD) binding protein. Binding of 
FAD enables ERQ1 (S. cerevisiae)-like to oxidise protein disulfide isomerase (PDI). We propose that the 

20 oxidisation of PDI by ERO 1 (S. cerevisiae)-like stops PDI autodegradation, therefore increasing levels of 
the protein. Increased levels of PDI have been shown to be neuroprotective by inhibiting apoptotic cell 
death. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are 
therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. HlFlalpha and EPAS1 are transcription factors that mediate the response to hypoxia 

25 of several genes, and have them selves been implicated in specific diseases. By adenoviral over- 
expression of EPAS1 we show augmentation of the hypoxic induction of certain genes, further 
confirming their status as responsive to hypoxia. EROI (S. cerevisiae)-like has been shown to be induced 
by hypoxia to a greater degree following adenoviral over-expression of EPAS1. Its preferential regulation 
by EPAS1 provides a route to preferential intervention, to avoid toxicity to other tissues. EROI (S. 

30 cerevisiae)-like is preferentially induced by hypoxia in mammary epithelial cells. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. EROI (S. cerevisiae)-like is 
repressed in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
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tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, ER01 (S. cerevisiae)-Iike is up-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plEI2 represents Hypothetical protein DKFZP434E1723. The protein 
5 sequence encoded by Hypothetical protein DKFZP434E1723 is represented in the public databases by the 
accession XP_05338 and is described in this patent by Seq ID 69. The nucleotide sequence is represented 
in the public sequence databases by the accession BC0I0005 and is described in this patent by Seq ID 70. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
10 products. 

The Oxford BioMedica clone plElO represents cDNA FLJ11041 fis clone PLACE1004405. The 
sequence encoded by cDNA FLJ1I04I fis, clone PLACE1004405 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AK001903 and is described in this patent by Seq ID 72. Hypoxia is an important feature 

15 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. The cDNA FLJ1104I fis 

20 clone PLACE1 004405 is induced in macrophages activated by LPS and gamma interferon. We expect it 
to have a pro-inflammatory role, and its inhibition may have an anti-inflammatory effect. 

The Oxford BioMedica clone p I C2 1 represents Tubulin, beta, 4. The protein sequence encoded by 
Tubulin, beta, 4 is represented in the public databases by the accession NP_006077 and is described in 
this patent by Seq ID 73. The nucleotide sequence is represented in the public sequence databases by the 
25 accession NM_006086 and is described in this patent by Seq ID 74. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pIDIO represents Insulin induced protein 2. The protein sequence encoded 
by Insulin induced protein 2 is represented in the public databases by the accession AAD43048 and is 
30 described in this patent by Seq ID 75. The nucleotide sequence is represented in the public sequence 
databases by the accession AF1 25392 and is described in this patent by Seq ID 76. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 



WO 02/46465 



PCT/GB01/05458 



120 

Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Insulin induced 
protein 2 is induced in macrophages activated by LPS and gamma interferon. We expect it to have a pro- 
5 inflammatory ro!e,.and its inhibition may have an anti-inflammatory effect. 

The Oxford BioMedica clones p 1 D 1 3 and plA22 represent Adenylate kinase 3. The protein sequence 
encoded by Adenylate kinase 3 is represented in the public databases by the accession NP_037542 and is 
described in this patent by Seq ID 77 and 263. The nucleotide sequence is represented in the public 
sequence databases by the accession N M _0 1 34 1 0 and is described in this patent by Seq ID 78 and 264. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 

15 relevant. Adenylate kinase 3 is induced in macrophages activated by LPS and gamma interferon. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Adenylate kinase 3 is induced in macrophages 

20 activated by TNFalpha. 

The Oxford BioMedica clone pi E9 represents a novel P3-3-kinase adapter. The protein sequence encoded 
by the novel PI-3-kinase adapter is not represented in the public databases by a protein accession but is 
described in this patent by Seq ID 79. The nucleotide sequence of an unannotated EST corresponding to 
the novel PI-3-kinase adapter is represented in the public sequence databases by the accession R62339 

25 and is described in this patent by Seq ID 80. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that the response 
to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by hypoxia to a 
greater degree in certain cell types, substantiating their utility in designing specific therapeutic products 

30 for diseases involving those cell types. Monocytes and macrophages have been implicated in the 
following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, COPD and peripheral 
arterial disease. The novel PI-3-kinase adapter is preferentially induced by hypoxia in monocytes or 
macrophages, indicating utility of the encoded protein in the design of therapeutic, prognostic and 
diagnostic products addressing diseases involving macrophages and hypoxia. In a gene array analysis it is 



WO 02/46465 



PCT/GBI1 1/05458 



121 

expressed in- hypoxic monocytes and macrophages at levels 6-fold higher than the median expression 
level of this gene throughout 9 other cell types in either normoxia or hypoxia. In more sensitive TaqMan 
analysis the novel PI-3-kinase adapter it is found to be expressed at approximately 1000 times the levels 
of 9 other cell types, all exposed to hypoxia for 18hr. The relevance of the novel PI-3-kinase adapter to 
5 human disease is also appreciated from comparison with a related murine gene, BCAP. It is known that 
this gene is phosphorylated by the tyrosine kinase, Syk. We also show novel data regarding Syk, in that it 
is also induced in response to hypoxia in a tissue specific manner identical to that of the novel PI-3-kinase 
adapter. Therefore the biological relevance and utility of our discovery of hypoxic induction of the novel 
PI-3-kinase adapter gene is further highlighted. 

10 

The Oxford BioMedica clone plFl represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA489477 and is described in this patent 
by Seq ID 82. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
15 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone plE7 represents a novel. Metallothionein. The protein sequence encoded by 
Novel Metallothionein is not represented in the public databases by a protein accession but is described in 
this patent by Seq ID 83. The nucleotide sequence is represented in the public sequence databases by the 

20 accession R06601 and is described in this patent by Seq ID 84. Metallothioneins can act as an antioxidant 
and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
HIF1 alpha and EPAS1 are transcription factors that mediate the response to hypoxia of several genes, and 

25 have them selves been implicated in specific diseases. By adenoviral over-expression of HIFI alpha we 
show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. The novel Metallothionein represented by Seq ID 84 has been shown to be 
induced by hypoxia to a greater degree following adenoviral over-expression of HIFlalpha. Hepatocytes 
are the main cell type of the liver and genes that are induced in response to hypoxia in this cell type are 

30 relevant to development of diagnostics and therapeutics towards liver diseases involving hypoxia, 
including cirrhosis. The novel Metallothionein represented by Seq ID 84 is preferentially induced by 
hypoxia in hepatocytes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
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shown -to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. The novel Metallothionein represented by Seq ID 84 is induced in macrophages 
activated by LPS and gamma interferon., 

The Oxford BioMedica clone plE6 represents EGL nine (C.elegans) homolog 3. The protein sequence 
5 encoded by EGL nine (C.elegans) homolog 3 is represented in the public databases by the accession 
NP_071356 and is described in this patent by Seq ID 85. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_022073 and is described in this patent by Seq ID 86. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 

10 products. HIF1 alpha and EPAS1 are transcription factors that mediate the response to hypoxia of several 
genes, and have them selves been implicated in specific diseases. By adenoviral over-expression of 
EPAS1 we show augmentation of. the hypoxic induction of certain genes, further confirming their status 
as responsive to hypoxia. EGL nine (C.elegans) homolog 3 has been shown to be induced by hypoxia to a 
greater degree following adenoviral over-expression of EPAS1. Its preferential regulation by EPAS1 

15 provides a route to preferential intervention, to avoid toxicity to other tissues. Hepatocytes are the main 
cell type of the liver and genes that are induced in response to hypoxia in this cell type are relevant to 
development of diagnostics and therapeutics towards liver diseases involving hypoxia, including 
cirrhosis. EGL nine (C.elegans) homolog 3 is preferentially induced by hypoxia in hepatocytes. We find 
that EGLN3 and a related human gene C 1 orfl 2 (seq ID 89/90) both of which are predicted to be proline 

20 hydroxylases, are expressed at differing absolute expression levels in different tissues. For instance, in the 
hypoxic hepatocyte (6hr) the normalised expression values of EGLN and clorfl 2 are 0.015 and 0.0074 
respectively, i.e. EGLN being the dominant gene. In contrast, in the neuroblastoma cell line SH-SY5Y, 
the normalised expression values of EGLN and clorfl 2 after 6hr hypoxia are 0.0012 and 0.108 
respectively, i.e. clorfI2 being the dominant gene by a large margin. This data demonstrates that 

25 cIORFI2 and EGLN3 are not constitulively expressed at an equal amount in different tissues indicating 
specificity of function. Therefore therapeutic products may be developed based on this data, with the goal 
of modulating proline hydroxylation of target proteins (such as HIF1 alpha) in specific tissues, based on 
the differing expression profile of clORF12 and EGLN3 in those tissues. 

The Oxford BioMedica clone plD14 represents Clorfl 2. The protein sequence encoded by Clorfl2 is- 
30 represented in the public databases by the accession NP_071334 and is described in this patent by Seq ID 
89. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)22051 and is described in this patent by Seq ID 90. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. We find that Clorfl 2 and a related 
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human gene EGLN3 (seq ID 85/86) both of which are predicted to be proline hydroxylases, are expressed 
at differing absolute expression levels in different tissues. For instance, in the hypoxic hepatocyte (6hr) 
the normalised expression values of EGLN and cl orf 1 2 are 0.015 and 0.0074 respectively, i.e. EGLN 
being the dominant gene. In contrast, in the neuroblastoma cell line SH-SY5Y, the normalised expression 
5 values of EGLN and c I orf 1 2 after 6hr hypoxia are 0.0012 and 0.108 respectively, i.e. dorfl2 being the 
dominant gene by a large margin. This data demonstrates that clORF12 and EGLN3 are not 
constitutively expressed at an equal amount in different tissues indicating specificity of function. 
Therefore therapeutic products may be developed based on this data, with the goal of modulating proline 
hydroxylalion of target proteins (such as H IF1 alpha) in specific tissues, based on the differing expression 
10 profile of clORF12 and EGLN3 in those tissues. 

The Oxford BioMedica clone p2B 1 represents PRAME. The protein sequence encoded by PRAME is 
represented in the public databases by the accession NP_006106 and is described in this patent by Seq ID 
87. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_0061I5 and is described in this patent by Seq ID 88. Hypoxia is an important feature of several 

15 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, PRAME is up-regulated in the malignant tissue as compared to adjacent normal tissue in 
at least one patient. PRAME. is a well-known tumour-associated antigen. Our surprising demonstration of 

20 its hypoxia-regulation provides for an important diagnostic test to distinguish false-positive results. In 
addition, we show the relevance of PRAME to hypoxia-related functions of tumours such as 
angiogenesis. 

The Oxford BioMedica clones plD17 and plP14 represent Semaphorin 4b. The protein sequence 
encoded by Semaphorin 4b is represented in the public databases by the accession BAB 2 1 836 and is 

25 described in this patent by Seq ID 91. The nucleotide sequence is represented in the public sequence 
databases by the accession AB051532 and is described in this patent by Seq ID 92. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
respojise of renal epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the 

30 medullary tissue. Semaphorin 4b is preferentially induced by hypoxia in renal epithelial cells. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Semaphorin 4b 
is induced in macrophages activated by LPS and gamma interferon.. Semaphorin 4b is also induced by 
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the the presence of reactive oxygen species. We expect it to have a pro-inflammatory role, and its 
inhibition may have an anti-inflammatory effect. We have cited elsewhere in this specification that a 
plexin is hypoxia-reguiated, and we propose a functional relationship between these two molecules. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 5 patients with either ovarian or breast cancer, Semaphorin 4b is up-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. Semaphorin 4b is also induced in response to 
superoxide radicals, as found in various disease states, implying utility. Semaphorin 4b is predicted to 
function in modulating several cellular processes key (o human disease, including angiogenesis, 
inflammation, immune cell migration and tissue remodelling. Other Semaphores including Semaphorin 
10 E, which are induced in response to hypoxia will also be implicated in these disease processes and have 
utility as described for Semaphorin 4b. 

The Oxford BioMedica clone plC24 represents SLC25AI9. The protein sequence encoded by SLC25AI9 
is represented in the public databases by the accession NP_06838O and is described in this patent by Seq 
ID 93. The nucleotide sequence is represented in the public sequence databases by the accession 
15 NM_021734 and is described in this patent by Seq ID 94. SLC25A19 transports deoxynucleotides into 
mitochondria and is therefore essential for mtDNA synthesis. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plD3 represents Serine carboxypeptidase I. The protein sequence encoded 
20 by Serine carboxypeptidase 1 is represented in the public databases by the accession NP_067639 and is 
described in this patent by Seq ID 95. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_021626 and is described in this patent by Seq ID 96. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
25 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Serine 
carboxypeptidase 1 is repressed in macrophages activated by LPS and gamma interferon. TNFalpha is an 
inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 
30 pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Serine carboxypeptidase 1 is induced in 
macrophages activated by TNFalpha. Increased serine carboxypeptidase activity in glial cells has been 
shown to result in neurological abnormalities, due to the degradation of essential neuro-active factors. 
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Similarly, peripheral neurological disease could result from such activity in macrophages. Our 
demonstration of hypoxia regulation of serine carboxypeptidase activity opens a route for diagnosis and 
treatment of these diseases. Hypoxia is frequently found in human tumours where macrophage infiltrates 
are also found. In a series of 5 patients with either ovarian or breast cancer, Serine carboxypeptidase I is 
5 down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 E 1 4 represents an unknown mRNA (schizophrenia-linked). The protein 
sequence encoded by the unknown mRNA (schizophrenia-linked) is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AY0101 12 and is described in this patent by Seq ID 98. Hypoxia is an important feature 

10 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory 

15 cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Unknown mRNA (schizophrenia-linked) is induced in macrophages activated 
by TNFalpha. There are many enzymic activities that can give rise to neurological abnormalities, and 

20 their hypoxia regulation is pertinent to the diagnosis and treatment of such diseases, including 
schizophrenia. 

The Oxford BioMedica clone pi E20 represents Myo-inositol monophosphatase A3. The protein sequence 
encoded by Myo-inositol monophosphatase A3 is represented in the public databases by the accession 
AAK52336 and is described in this patent by Seq ID 99. The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM_017813 and is described in this patent by Seq ID 100. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. As refered to elsewhere in this specification, we have found several components of the 
phosphatidylinisotol second messenger system to be hypoxia-regulated. This system has profound effects 

30 which are relevant to many diseases with known associations with hypoxia and ischaemia. Local and 
transient ischaemia is relevant to such diseases as rheumatoid arthritis and atherosclerosis, and also 
potentially to such diseases as schizophrenia and bi-polar disorder. It is instructive that lithium, which is a 
well-recognised treatment for affective disorders, appears to operate via the phosphatidylinisotol systtm 
[Pettegrew et al 2001, Bipolar Disord 3:189-201]. Macrophages are key to several diseases involving 
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hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. Myo-inositol monophosphatase A3 is repressed in 
macrophages activated by LPS and gamma interferon. 

5 The Oxford BioMedica clone p2A24 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA521314 and is described in this patent 
by Seq ID 102. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
10 prognostic and diagnostic products. 

The Oxford BioMedica clone pi El 7 represents Hypothetical protein FLJ31668. The protein sequence 
encoded by Hypothetical protein FLJ31668 is represented in the public databases by the accession 
BAB71124 and is described in this patent by Seq ID 103. The nucleotide sequence is represented in the 
public sequence databases by the accession AK056230 and is described in this patent by Seq ID 104. 
15 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. . 

The Oxford BioMedica clone p 1 E 1 9 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 

20 represented in the public sequence databases by the accession R51835 and is described in this patent by 
Seq ID 106. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 106 is 

25 up-regulated in the malignant iissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 E 1 5 represents cDNA YI27F12. The protein sequence encoded by cDNA 
YI27F12 is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AF075018 and is described in this patent 
by Seq ID 108. Hypoxia is an important feature of several diseases, and genes that respond to this 
30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. The cDNA YI27F12 is induced in macrophages treated with the 
inhibitory cytokine 1L-10. The cDNA YI27F12 is repressed in macrophages activated by IL-17. We 
expect the product of cDNA YI27F12 to have an anti-inflammatory role. 
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The Oxford BioMedica clone plEll represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R69248 and is described in this patent by 
Seq ID 1 10. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone plE23 represents cDNA FLJ14041 fis, clone HEMBA1005780. The 
protein sequence encoded by cDNA FLJ14041 fis, clone HEMBAI005780 is not represented in the 
public databases by a protein accession. The nucleotide sequence is represented in the public sequence 
10 databases by the accession AK024103 and is described in this patent by Seq ID 112. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p)E21 represents Glutamate-cysteine ligase, modifier subunit. The protein 
sequence encoded by Glutamate-cysteine ligase, modifier subunit is represented in the public databases 

15 by the accession NP_002052 and is described in this patent by Seq ID 113. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_002061 and is described in this patent 
by Seq ID 114. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Glutamate-cysteine ligase is the rate-limiting enzyme of glutathione 

20 synthesis, and this enzyme is relevant to ceil survival under stress. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Glutamate-cysteine ligase, modifier subunit is up-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plD23 represents PTEN. The protein sequence encoded by PTEN is 
25 represented in the public databases by the accession NP_000305 and is described in this patent by Seq ID 
1 15. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)00314 and is described in this patent by Seq ID 1 16. PTEN is a member of the mixed function, 
serine/threonine/tyrosine phosphatase subfamily of protein phosphatases. Its physiological substrates, 
however, are primarily 3-phosphorylated inositol phospholipids, which are products of phosphoinositide 
30 3-kinases [Downes et al 2001, Biochem Soc Trans 29:846-51]. Hypoxia-regulation of this gene is a 
further element in the hypoxic regulation of this important second messenger system. Hypoxia is an . 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
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The Oxford BioMedica clone plD24 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession T73780 and is described in this patent by 
Seq ID 118. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
5 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. The response of renal epithelial cells to hypoxia is pertinent to kidney failure, 
especially regarding the medullary tissue. The EST represented by Seq ID 118 is preferentially induced 
by hypoxia in renal epithelial cells. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
10 have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 118 is induced in macrophages 
activated by LPS and gamma interferon. 

The Oxford BioMedica clones plD22 and plG5 represent MAX-interacting protein I. The protein 
sequence encoded by MAX-interacting protein 1 is represented in the public databases by the accession 

15 NP_005953 and is described in this patent by Seq ID 1 19 and 279. The nucleotide sequence is . 

represented in the public sequence databases by the accession NM_005962 and is described in this patent 
by Seq ID 120 and 280. MAX-interacting protein 1 is a negative regulator of myc oncoprotein with tumor 
suppressor properties, Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

20 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. MAX-interacting protein 1 is repressed in macrophages activated by 
LPS and gamma interferon. 

25 The Oxford BioMedica clone plE2 represents Mannosidase, alpha, class 1A, member 1. The protein 
sequence encoded by Mannosidase, alpha, class 1A, member 1 is represented in the public databases by 
the accession NP_005898 and is described in this patent by Seq ID 121. The nucleotide sequence is 
represented in the public sequence databases by the accession NM.005907 and is described in this patent 
. by Seq ID 122. Hypoxia is an important feature of several diseases, and genes that respond to this 

30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Mannosidase, 
alpha, class 1A, member I is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 
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The Oxford BioMedica clone pIEl represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA446361 and is described in this patent 
by Seq ID 124. Hypoxia is an important feature of several diseases, and genes that respond to this 
5 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 124 is repressed in macrophages 
10 activated by LPS and gamma interferon. 

The Oxford BioMedica clone p 1 E4 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA931411 and is described in this patent 
•by Seq ID 126. Hypoxia is an important feature of several diseases, and genes that respond to this 

15 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 126 is repressed in macrophages 

20 activated by LPS and gamma interferon. We expect this gene product to have an anti-inflammatory role. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, the EST represented by Seq ID 3 26 is down-regulated in 
the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi D 1 8 represents cDNA FLJ13443 fis, clone PLACE1002853. The protein 
25 sequence encoded by cDNA FLJ13443 fis, clone PLACEI002853 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AK023505 and is described in this patent by Seq ID 128. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
30 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
• sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJI3443 fis, clone PLACE1002853 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 
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The Oxford BioMedica clone pi D21 represents Hypothetical protein FLJ22622. The protein sequence 
encoded by Hypothetical protein FLJ22622 is represented in the public databases by the accession 
BAB 15424 and is described in this patent by Seq ID 129. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_025151 and is described in this patent by Seq ID 130. 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. The response of renal epithelial cells to hypoxia is pertinent to kidney failure, especially 
regarding the medullary tissue. Hypothetical protein FLJ22622 is preferentially induced by hypoxia in 
renal epithelial cells. Macrophages are key to several diseases involving hypoxia, and contribute to 

10 inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Hypothetical protein FLJ22622 is repressed in macrophages activated by LPS and 
gamma interferon. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 

15 cancer, Hypothetical protein FLJ22622 is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clone p?C22 represents CD84-HI. The protein sequence encoded by CD84-HI is 
represented in the public databases by the accession AAK69052 and is described in this patent by Seq ID 
131. The nucleotide sequence is represented in the public sequence databases by the accession AF275725 
20 and is described in this patent by Seq ID 132. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plC23 represents Hypothetical protein FLJ12832. The protein sequence 
encoded by Hypothetical protein FLJ12832 is represented in the public databases by the accession 

25 XP_043394 and is described in this patent by Seq ID 133. The nucleotide sequence is represented in the 
public sequence databases by the accession AK022894 and is described in this patent by Seq ID 134. 
Hypothetical protien FLJ12832 is a putative ubiquitin as it shows high structural similarity to ubiquitin C 
and contains a ubiquitin domain. Hypoxia is an important feature of several diseases, and genes that 
respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 

30 therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plDll represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA251748 and is described in this patent 
by Seq ID 136. Hypoxia is an important feature of several diseases, and genes that respond to this 
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stimulus are therefore implicated in the pathogenesis and have, utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clones plE3 and pi Fl 6 represent CYP1B1. The protein sequence encoded by 
CYPIB1 is represented in the public databases by the accession NP_G00095 and is described in this 
5 patent by Seq ID 137 and 325. The nucleotide sequence is represented in the public sequence databases 
by the accession NMJ00104 and is described in this patent by Seq ID 138 and 326. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 
a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 

10' that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD. CYP1B1 is preferentially induced by hypoxia in monocytes or 
macrophages and a restricted number of other cell types. Macrophages are key to several diseases 

15 involving hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently 
activated by cytokines, which have been shown to be present at disease sites, so gene expression 
responses to both hypoxia and cytokines are especially relevant. CYPIB1 is repressed in macrophages 
activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 

20 including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
CYP1B1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, CYPlBlis up-regulated and also down regulated in the malignant tissue as compared to adjacent 

25 normal tissue in at least one patient. 

The Oxford BioMedica clone plD20 represents Hypothetical protein KIAA1125. The protein sequence 
encoded by Hypothetical protein KIAA1125 is represented in the public databases by the accession 
XP_012932 and is described in this patent by Seq ID 139. The nucleotide sequence is represented in the 
public sequence databases by the accession AB032951 and is described in this patent by Seq ID 140. 
30 Hypoxia is an important featu're of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plE5 represents Hepcidin antimicrobial peptide. The protein sequence 
encoded by Hepcidin antimicrobial peptide is represented in the public databases by the accession 
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NPJJ66998 and is described in this patent by Seq ID 141. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJ21I75 and is described in this patent by Seq ID 142. 
Hypoxia is an important feature. of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
5 products. Hepatocytes are the main cell type of the liver and genes that are induced in response to hypoxia 
in this cell type are relevant to development of diagnostics and therapeutics towards liver diseases 
involving hypoxia, including cirrhosis. Hepcidin antimicrobial peptide is preferentially induced by 
hypoxia in hepatocytes. Hepcidin antimicrobial peptide is induced in macrophages treated with the 
inhibitory cytokine IL-10. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has 
10 been shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Hepcidin antimicrobial 
peptide is repressed in macrophages activated by TNFalpha. 

The Oxford BioMedica clone pi D 19 represents an unannotated EST. The protein sequence encoded by 
15 this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R68736 and is described in this patent by 
Seq ID 144. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
20 inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. The EST represented by Seq ID 144 is induced in macrophages activated by LPS and 
gamma interferon and is also induced in macrophages activated by IL-15. We expect the gene product 
relevant to the EST represented by Seq ID 144 to have a pro-inflammatory role, and its inhibition may 
25 have an anti-inflammatory effect. 

The Oxford BioMedica clone p2A15 represents Sialyltransferase. The protein, sequence encoded by 
Sialyltransferase is represented in the public databases by the- accession NPJ)06447 and is described in 
this patent by Seq ID 145. The nucleotide sequence. is represented in the public sequence databases by the 
accession NM_006456 and is described in this patent by Seq ID 146. Hypoxia is an important feature of 
30 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 1 1 4 represents cDNA DKFZp564D016. The protein sequence encoded by 
cDNA DKFZp564D016 is not represented in the public databases by a protein accession. The nucleotide 
sequence is represented in the public sequence databases by the accession AL050021 and is described in 
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this patent by Seq ID 148. Hypoxia is an important feature of several diseases, and genes that respond to 
this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 12 represents cDNA FU1 1302 fis, clone PLACE100997I . The protein 
5 sequence encoded by cDNA FLJ1 1302 fis, clone PLACE1009971 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AK002164 and is described in this patent by Seq ID 150. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
10 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
' sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJ11302 fis, clone PLACE1009971 is repressed in macrophages activated by LPS and gamma 
interferon. We expect it to have an anti-inflammatory role. 

15 The Oxford BioMedica clone p 1 1 1 2 represents Hypothetical protein MGC4549. The protein sequence 
encoded by Hypothetical protein MGC4549 is represented in the public databases by the accession 
XP_032794 and is described in this patent by Seq ID 151. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_032377 and is described in this patent by Seq ID 152. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypothetical protein MGC4549 is induced in macrophages treated with the inhibitory cytokine 
IL-10. Hypothetical protein MGC4549 is repressed in macrophages activated by IL-17 and is also 
repressed in macrophages activated by IL-15. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone pi 13 represents ELM02. The protein sequence encoded by ELM02 is 
25 represented in the public databases by the accession A AL14467 and is described in this patent by Seq ID 
153. The nucleotide sequence is represented in the public sequence databases by the accession 
XMJJ12933 and is described in this patent by Seq ID 154. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. This gene has been shown recently 
30 to promote phagocytosis and cell shape changes [Gumienny et al 2001, Cell 107:27-41]. These functions 
are typical of the macrophage, and are likely to play a role in macrophage-associated diseases. 

The Oxford BioMedica clone pi II 0 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
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represented in the public sequence databases by the accession AA420992 and is described in this patent 
by Seq ID 156. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

5 The Oxford BioMedica clone p I H 1 8 represents Ubiquitin specific protease 7. The protein sequence 
encoded by Ubiquitin specific protease 7 is represented in the public databases by the accession 
NP_00346 1 and is described in this patent by Seq ID 157. The nucleotide sequence is represented in the 
public sequence databases by the accession NM J)03470 and is described in this patent by Seq ID 158. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
10 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Ubiquitin specific protease 7 is up-regulated in 
the malignant tissue as compared to adjacent normal tissue in at least one patient. We expect decreased 
activity of the gene product to have an anti-tumour effect. 

15 The Oxford BioMedica clone plH24 represents Nucleolar phosphoprotein Nopp34. The protein sequence 
encoded by Nucleolar phosphoprotein Nopp34 is represented in the public databases by the accession 
NP_ 1 1 5766 and is described in this patent by Seq ID 159. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_032390 and is described in this patent by Seq ID 160. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plE22 represents cDNA FLJI36I8 fis, clone PLACE1010925. The protein 
sequence encoded by cDNA FLJ13618 fis, clone PLACE1010925 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 

25 by the accession AK023680 and is described in this patent by Seq ID 162. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 

30 sites, so gene expression responses to both hypoxia and cytokines are especially relevant. The cDNA 
FLJ13618 fis, clone PLACE1010925 is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone p I H2 1 represents Hypothetical protein FLJ13511. The protein sequence 
encoded by Hypothetical protein FLi 13511 is represented in the public databases by the accession 
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NPJ49014 and is described in this patent by Seq ID 163. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJ33025 and is described in this patent by Seq JD 164. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
5 products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Hypothetical protein FLJ13511 is preferentially 
10 induced by hypoxia in monocytes or macrophages and a restricted number of other cell types. Hypoxia is 
frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, Hypothetical protein FLJ13511 is down-regulated in the malignant 
tissue as compared to adjacent norma) tissue in at least one patient. 

The Oxford BioMedica clone pill represents Ribosomal RNA intergenic spacer. The protein sequence 
15 encoded by Ribosomal RNA intergenic spacer is not represented in the public databases by a protein 
accession. The nucleotide sequence is represented in the public sequence databases by the accession 
AA664228 and is described in this patent by Seq ID 166. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

20 The Oxford BioMedica clone plH 14 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R44397 and is described in this patent by 
Seq ID 168. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 

25 diagnostic products. 

The Oxford BioMedica clone plHll represents Carboxypeptidase M. The protein sequence encoded by 
Carboxypeptidase M is represented in the public databases by the accession NP_001865 and is described 
in this patent by Seq ID 169. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_001874 and is described in this patent by Seq ID 170. Hypoxia is an important feature 
30 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pi H 17 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
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represented in the public sequence databases by the accession W 87747 and is described in this patent by 
Seq ID 172. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
5 found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by- Seq ID 172 is 
up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone pi H 1 2 represents an unannotated EST. The protein sequence encoded by 
■ this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
10 represented in the public sequence databases by the accession AA973568 and is described in this patent 
by Seq ID 174. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus 'are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clone plH7 represents an unannotated EST. The protein sequence encoded by this 
15 EST is not represented irr the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession T98529 and is described in this patent by 
Seq ID 176. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

20 The Oxford BioMedica clone pi H 1 5 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA022679 and is described in this patent 
by Seq ID 178. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

25 prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by Seq ID 178 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent 
normal" tissue in at least one patient. 

The Oxford BioMedica clone plH20 represents an unannotated EST. The protein sequence encoded by 
30 EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession HI7921 and is described in this patent by 
Seq ID 180. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
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diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. Jn a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 1 80 is 
up-regulated in the malignant tissue as. compared to adjacent normal tissue in at least one patient. We 
expect decreased activity of the gene product to have an anti-tumour effect: 

5 The Oxford BioMedica clone plH8 represents ABL. The protein sequence encoded by ABL is 
represented in the public databases by the accession NP_009297 and is described in this patent by Seq ID 
181. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ)07313 and is described in this patent by Seq ID 182. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

10 utility in the design of therapeutic, prognostic and diagnostic products. ABL is induced in macrophages 
treated with the inhibitory cytokine IL-IO. ABL is repressed in macrophages activated by IL-17 and is 
also repressed in macrophages activated by IL-15. We expect it to have an anti-inflammatory role 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, ABL is up-regulated in the malignant tissue as compared 

15 to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plH16 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession W 91 958 and is described in this patent by 
Seq ID 184. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
20 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
. diagnostic products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, the EST represented by Seq ID 184 is 
up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi H9 represents an unannotated EST. The protein sequence encoded by this 
25 EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R63694 and is described in this patent by 
Seq ID 186. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

30 The Oxford BioMedica clone plH23 represents Hypothetical protein FLJ21094. The protein sequence 
encoded by Hypothetical protein FLJ2I094 is represented in the public databases by the accession 
AAHI4003 and is described in this patent by Seq ID 187. The nucleotide sequence is represented in the 
public sequence databases by the accession AK024747 and is described in this patent by Seq ID 188. 
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Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plHIO represents an unannotated EST. The protein sequence encoded by 
5 this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA909912 and is described in this patent 
by Seq ID 190. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

10 The Oxford BioMedica clone plH6 represents ah unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession T99032 and is described in this patent by 
Seq ID 192. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 

15 diagnostic products. The EST represented by Seq ID 192 is induced in macrophages treated with the 
inhibitory cytokine IL-10. The EST represented by Seq ID 192 is repressed in macrophages activated by 
IL-15. We expect it to have an anti-inflammatory role. 

The Oxford BioMedica clone p 1 H 1 3 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 

20 represented in the public sequence databases by the accession H52503 and is described in this patent by 
Seq ID 194. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

25 shown to be present at disease sites; so gene expression responses to both hypoxia and cytokines are 
especially relevant. The EST represented by Seq ID 194 is repressed in macrophages activated by LPS 
and gamma interferon. . 

The Oxford BioMedica clone p I H 1 9 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
30 represented in the public sequence databases by the accession AA 127017 and is described in this patent 
by Seq ID 196. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic. and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
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infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by the Seq ID 196 is down-regulated in the malignant tissue. as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone plG22 represents an unannotated EST. The protein sequence encoded by 
5 this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession R38647 and is described in this patent by 
Seq ID 198. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Endothelial cells are key to angiogenesis, a process implicated in several diseases 
10 associated with hypoxia, including cancer and rheumatoid arthritis. The EST represented by Seq ID 198 is 
preferentially induced by hypoxia in endothelial cells. We expect this gene product to have a pro- 
angiogenic effect, and its inhibition to have an anti-angiogenic effect. • 

The Oxford BioMedica clone plG21 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
15 represented in the public sequence databases by the accession T87233 and is described in this patent by 
Seq ID 200. Hypoxia is an important-feature of several diseases, and genes that respond to this stimulus 
are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. 

The Oxford BioMedica clone plHl represents Hypothetical protein FLJ10826. The protein sequence 
20 encoded by Hypothetical protein FLJ10826 is represented in the public databases by the accession 
BAB 14226 and is described in this patent by Seq ID 201. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJ18233 and is described in this patent by Seq ID 202. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility, in the design of therapeutic, prognostic and diagnostic 
25 products. 

The Oxford BioMedica clone plG20 represents cDNA YO23H03. The protein sequence encoded by 
cDNA YO23H03 is not represented in the public databases by a protein accession. The nucleotide 
sequence is represented in the public sequence databases by the accession AF075053 and is described in 
this patent by Seq ID 204. Hypoxia is an important feature of several diseases, and genes that respond to 
30 this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
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cytokines are especially relevant. The cDNA YO23H03 is repressed in macrophages activated by LPS 
and gamma interferon. 

The Oxford BioMedica clone pi H 5 represents Hypothetical protein FLJ22690. The protein sequence 
encoded by Hypothetical protein FLJ22690 is represented in the public databases by the accession 
5 NP_078987 and is described in* Ih is patent by Seq ID 205. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJJ247I1 and is described in this patent by Seq ID 206. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Endothelial cells are key to angiogenesis, a process implicated in several diseases associated 

10 with hypoxia, including cancer and rheumatoid arthritis. Hypothetical protein FLJ22690 is preferentially 
induced by hypoxia in endothelial cells. We expect this gene product to have a pro-angiogenic effect, and 
its inhibition to have an anti-angiogenic effect. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

15 hypoxia and cytokines are especially relevant. Hypothetical protein FLJ22690 is induced in macrophages 
activated by 1L-15. 

The Oxford BioMedica clone pi G 1 9 represents Mitochondrion sequence. The protein sequence encoded 
by Mitochondrion sequence is represented in the public databases by the accession AAH05845 and is 
described in this patent by Seq ID 207. The nucleotide sequence is represented in the public sequence 

20 databases by the accession BC005845 and is described in this patent by Seq ID 208. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, the Mitochondrion sequence represented by Seq ID 208 is 

25 down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plH2 represents Fatty acid binding protein 5. The protein sequence 
encoded by Fatty acid binding protein 5 is represented in the public databases by the accession 
NP_001435 and is described in this patent by Seq ID 209. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJ01444 and is described in this patent by Seq ID 210. 
30 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the. art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
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type's. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Fatty acid binding protein 5 is preferentially induced 
by hypoxia in monocytes or macrophages. Crucially and very recently, Fatty acid binding protein 5 
expressed in macrophages has been shown to play a very important role in the development of 
5 atherosclerotic plaques [Layne et al 2001, FASEB J 15:2733-5], Our demonstration of hypoxic-regulation 
of this gene not only makes clear how this gene can participate in disease initiation and progression, but 
provides for a potential route to diagnosis and therapy of atherosclerosis. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

10 expression responses to both hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Fatty acid binding protein 5 is repressed in macrophages activated by 

15 TNFalpha. 

The Oxford BioMedica clone pi G 1 8 represents Mitochondrion sequence. The protein sequence encoded 
by Mitochondrion sequence is not represented in the public databases by a protein accession. The 
nucleotide sequence is represented in the public sequence databases by the accession BC00I612 and is 
described in this patent by Seq ID 212. Hypoxia is an important feature of several diseases, and genes that 

20 respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In "these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. The Mitochondrion sequence represented by Seq ID 212 is 

25 repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plH4 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA679939 and is described in this patent 
by Seq ID 214. Hypoxia is an important feature of several diseases, and genes that respond to this 
30 stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 214 is repressed in macrophages 
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activated by IL-17. We expect it to have an anti-inflammatory role. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, the EST represented by the Seq ID 214 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

5 The Oxford BioMedica clone pIH3 represents an unannotated EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein" accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA630167 and is described in this patent 
by Seq ID 216. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
10 prognostic and diagnostic products. Hypoxia is frequently found in human tumours where macrophage 
infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, the EST represented 
by Seq ID 216 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The protein sequence encoded by BCL2/adenoviros E1B 19kD-interacting protein 3-like is represented in 
15 the public databases by the accession NP_004322 and is described in this patent by Seq ID 217. The 
nucleotide sequence is represented in the public sequence databases by the accession NM J)04331 and is 
described in this patent by Seq ID 218. Hypoxia is an important feature of several diseases, and genes that 
respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. 

20 The protein sequence encoded by SLC2A1 is represented in the public databases by the accession 
NP_006507 and is described in this patent by Seq ID 219. The nucleotide sequence is represented in the 
public sequence databases by the accession NMJ06516 and is described in this patent by Seq ID 220. 
SLC2A1 is a glucose transporter gene and is also known as GLUTI. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

25 have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plP3 represents PDGFB. The protein sequence encoded by PDGFB is 
represented in the public databases by the accession NPJ48937 and is described in this patent by Seq ID 
221. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)330I6 and is described in this patent by Seq ID 222. Hypoxia is an important feature of several 
30 diseases, and genes that respond to "this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
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expression responses to both hypoxia and cytokines are especially relevant. PDGFB is induced in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clones plA8 and plA9 represent Lactate dehydrogenase A. The protein sequence 
encoded by Lactate dehydrogenase A is represented in the public databases by the accession NP_005557 
5 and is described in this patent by Seq ID 223. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_005566 and is described in this patent by Seq ID 224. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

10 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Lactate 
dehydrogenase A is repressed in macrophages activated by LPS and gamma interferon and is also 
repressed in macrophages activated by JL-I5. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 

15 including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Lactate dehydrogenase A is induced in macrophages activated by TNFalpha. Hypoxia is frequently found 
in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Lactate dehydrogenase A is up-regulated in the malignant tissue as compared to 

20 adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plB17 represents Tissue factor. The protein sequence encoded by Tissue 
factor is represented in the public databases by the accession NP_001984 and is described in this patent 
by Seq ID 225. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001993 and is described in this patent by Seq ID 226. Hypoxia is an important feature of several 

25 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Tissue factor is preferentially 
induced by hypoxia in mammary epithelial cells. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes, in these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

30 hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including. rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Tissue factor is induced in macrophages activated by TNFalpha. Tissue factor is the primary initiator of 
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blood coagulation with structural,homology to the cytokine receptor family, and has been implicated in 
various vascular processes including metastasis, angiogenesis, and atherosclerosis. Our demonstration of 
hypoxic regulation leads to a clear undertanding of the possibility of intervention in disease by 
modulation of Tissue factor activity. 

5 The Oxford BioMedica clone plO20 represents VEGF. The protein sequence encoded by VEGF is 
represented in the public databases by the accession NPJ03367 and is described in this patent by Seq ID 
227. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J03376 and is described in this patent by Seq ID 228. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

10 utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, VEGF is up-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone plB2 represents N-myc downstream regulated. The protein sequence 
15 encoded by N-myc downstream regulated is represented in the public databases by the accession 
NP_006087 and is described in this patent by Seq ID 229. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_006096 and is described in this patent by Seq ID 230. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
20 products. N-myc downstream regulated is preferentially induced by hypoxia in mammary epithelial cells. 

• The Oxford BioMedica clone plB3 represents Proline 4-hydroxylase, alpha polypeptide I. The protein 
sequence encoded by Proline 4-hydroxylase, alpha polypeptide I is represented in the public databases by 
the accession NPJ00908 and is described in this patent by Seq ID 231. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_000917 and is described in this patent 

25 by Seq ID 232. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 

30 cytokines are especially relevant. Proline 4-hydroxylase, alpha polypeptide I is repressed in macrophages 
activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Proline 
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4-hydroxylase, alpha polypeptide I is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The protein sequence encoded by BCL2/adenovirus ElB-interacting protein 3 is represented in the public 
databases by the accession NP_004043 and is described in this patent by Seq ID 233. The nucleotide 
5 sequence is represented in the public sequence databases by the accession NM_OO4052 and is described 
in this patent by Seq ID 234. Hypoxia is an important feature of several diseases, and genes that respond 
to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clones p 1 B 1 8 and pi B 1 9 represent Plasminogen activator inhibitor, type 1. The 

10 protein sequence encoded by Plasminogen activator inhibitor, type I is represented in the public databases 
by the accession NP_000593 and is described in this patent by Seq ID 235. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_000602 and is described in this patent 
by Seq ID 236. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

15 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Plasminogen activator inhibitor, type 1 is induced in macrophages 
activated by LPS and gamma interferon. Plasminogen activator inhibitor, type I is repressed in 

20 macrophages activated by IL-17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
. therapeutic, prognostic and diagnostic products for such inflammatory conditions. Plasminogen activator 
inhibitor, type 1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 

25 tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Plasminogen activator inhibitor, type 1 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 N 1 7 represents C0X-2. The protein sequence encoded by COX-2 is 
represented in the public databases by the accession NP_000954 and is described in this patent by Seq ID 
30 237. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_O0O963 and is described in this patent by Seq ID 238. Hypoxia is an important feature of several 
. diseases, and genes that respond to this stimulos are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. COX-2 is preferentially induced 
* by hypoxia in mammary epithelial cells. Macrophages are key to several diseases involving hypoxia, and 
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contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. COX-2 is induced in macrophages activated by LPS and gamma 
interferon. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
5 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. COX-2 is induced in macrophages activated by 
TNFalpha. In view of the known role of COX-2 in prostaglandin synthesis and tumour progression, its 
induction by hypoxia has profound clinical implications, and clear utility in diagnosis and therapy. 
10 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. 

The Oxford BioMedica clone pi A24 represents Metallothionein IH. The protein sequence encoded by 
Metallothionein 1 H is represented in the public databases by the accession NP_005942 and is described in 
this patent by Seq ID 239. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_005951 and is described in this patent by Seq ID 240. Metallothioneins can act as an 

15 antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Hepatocytes are the main cell type of the liver and genes which are induced in response to hypoxia in this 
cell type are relevant to development of diagnostics and therapeutics towards liver diseases involving 

20 hypoxia, including cirrhosis. Metallothionein IH is preferentially induced by hypoxia in hepatocytes. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 
IH is induced in macrophages activated by LPS and gamma interferon. 

25 The protein sequence encoded by Metallothionein 1L is represented in the public databases by the 
accession NP_002441 and is described in this patent by Seq ID 241. The nucleotide sequence is 
represented in the public sequence databases by the accession NM J02450 and is described in this patent 
by Seq ID 242. Metallothioneins can act as an antioxidant and free-radical scavenger and are therefore 
protective against cell death in hypoxia. Hypoxia is an important feature of several diseases, and genes 

30 that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plBl represents Metallothionein 1G. The protein sequence encoded by 
Metallothionein 1G is represented in the public databases by the accession NP_005941 and is described in 
this patent by Seq ID 243. The nucleotide sequence is represented in the public sequence databases by the 
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accession NM J05950 and is described in this patent by Seq ID 244. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
5 H1F1 alpha and EPASI are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of HIFlalpha 
we show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. Metallothionein 1G has been shown to be induced by hypoxia to a greater degree 
following adenoviral over-expression of HIFlalpha. Hepatocytes are the main cell type of the liver and 

10 genes which are induced in response to hypoxia in this cell type are relevant to development of 
diagnostics and therapeutics towards liver diseases involving hypoxia, including cirrhosis. 
Metallothionein 1G is preferentially induced by hypoxia in hepatocytes. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

15 expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 1G is 
induced in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Metallothionein IG is up-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

20 The protein sequence encoded by Metallothionein IE (functional) is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA872383 and is described in this patent by Seq ID 246. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 

25 in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clones plAl, p]A2, pi A3 and plA4 represent SLC2A3. The protein sequence 
encoded by SLC2A3 .is represented in the public databases by the accession NP_008862 and is described 
in this patent by Seq ID 247. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_006931 and is described in this patent by Seq ID 248. Hypoxia is an important feature 
30 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. SLC2A3 is induced in 
. macrophages treated with the inhibitory cytokine IL-IO. TNFalpha is an inflammatory cytokine, which 
acts on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
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utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
SLC2A3 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clones p 1 A 1 5, pi A 1 6, pi A 1 7 and pi A 1 8 represent Hexokinase-2. The protein 
sequence encoded by Hexokinase-2 is represented in the public databases by the accession NP_000180 
5 and is described in this patent by Seq ID 249. The nucleotide sequence is represented in the public 
sequence databases by the accession NM_000189 and is described in this patent by Seq ID 250. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
10 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Hexokinase-2 is 
repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clones pi B 14, pi B 1 5 and p 1 B 1 6 represent Interleukin 8. The protein sequence 
encoded by Interleukin 8 is represented in the public databases by the accession NP_000575 and is 

15 described in this patent by Seq ID 251. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_000584 and is described in this patent by Seq ID 252. Hypoxia is an 
important feature of several diseases, and genes that respond-to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

20 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Interleukin 8 is 
induced in macrophages activated by LPS and gamma interferon and is also induced in macrophages 
activated by IL-17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been 
shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 

25 Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Interleukin 8 is induced 
in macrophages activated by TNFalpha. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Interleukin 8 is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least 

30 . one patient. 

The Oxford BioMedica clones plAII and pi A 1 2 represent GAPDH. The protein sequence cncodtd by 
GAPDH is represented in the public databases by the accession NP_002037 and is described in this patent 
by Seq ID 253. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002046 and is described in this patent by Seq ID 254. Hypoxia is an important feature of several 
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diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
5 expression responses to both hypoxia and cytokines are especially relevant. GAPDH is repressed in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17 or IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to 
be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that 
change in expression in response to TNFalpha therefore have utility in the design of therapeutic, 
10 prognostic and diagnostic products for such inflammatory conditions. GAPDH is induced in macrophages 
activated by TNFalpha. Hypoxia is frequently found in human tumours where macrophage infiltrates are 
also found. In a series of 5 patients with either ovarian or breast cancer, GAPDH is up-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi A 13 represents Phosphoglycerate kinase 1. The protein sequence 
15 encoded by Phosphoglycerate kinase 1 is represented in the public databases by the accession NP_000282 
and is described in this patent by Seq ID 255. The nucleotide sequence is represented in the public 
sequence databases by the accession NMJ00291 and is described in this patent by Seq ID 256. Hypoxia 
is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic,. prognostic and diagnostic products. 
20 Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. 
Phosphoglycerate kinase 1 is repressed in macrophages activated by LPS and gamma interferon. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
25 the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Phosphoglycerate kinase 1 is induced in 
macrophages activated by TNFalpha. 

The Oxford BioMedica clone pi A 14 represents Enolase 1. The protein sequence encoded by Enolase 1 is 
30 represented in the public databases by the accession NPJ)01419 and is described in this patent by Seq ID 
257. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001428 and is described in this patent by Seq ID 258. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
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diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Enolase 1 is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
5 on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Enolase 1 is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
10 cancer, Enolase 1 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least 
one patient. 

The Oxford BioMedica clone pi A 19 represents Aldolase C. The protein sequence encoded by Aldolase C 
is represented in the public databases by the accession NP__005 1 56 and is described in this patent by Seq 
ID 259. The nucleotide sequence is represented in the public sequence databases by the accession 

15 NM_005165 and is described in this patent by Seq ID 260. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated. by cytokines, which have been shown to be present at disease sites, so gene 

20 expression responses to both hypoxia and cytokines are especially relevant. Aldolase C is induced in 
macrophages activated by IL-15. Aldolase C is repressed in macrophages activated by IL-15. TNFalpha is 
an inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 
pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 

25 diagnostic products for such inflammatory conditions. Aldolase C is induced in macrophages activated by 
TNFalpha. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. ■ 
In a series of 5 patients with either ovarian or breast cancer, Aldolase is up-regulated in the malignant 
tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plA20 represents Triosephosphate isomerase 1. The protein sequence 
30 encoded by Triosephosphate isomerase 1 is represented in the public databases by the accession 
NP JG0356 and is described in this patent by Seq ID 261. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000365 and is described in this patent by Seq ID 262. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Triosephosphate isomerase 1 is repressed in macrophages activated by LPS and gamma 
5 interferon and is also repressed in macrophages activated by IL-15. TNFalpha is an. inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 
treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Triosephosphate isomerase 1 is induced in macrophages activated by TNFalpha. 

10 The Oxford BioMedica clone pi A23 represents Metallothionein 2A. The protein sequence encoded by 
Metallothionein 2A is represented in the public databases by the accession NP_005944 and is described in 
this patent by Seq ID 265. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_005953 and is described in this patent by Seq ID 266. Metallothioneins can act as an 
antioxidant and free-radical scavenger and are therefore protective against cell death in hypoxia. Hypoxia 

15 is an important feature of several diseases, and genes that respond to this stimulus are therefore implicated 
in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
H1F1 alpha and EPA SI are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of HIF1 alpha 
we show augmentation of the hypoxic induction of certain genes, further confirming their status as 

20 responsive to hypoxia. Metallothionein 2A has been shown to be induced by hypoxia to a greater degree 
following adenoviral over-expression of HIF1 alpha. Hepatocytes are the main cell type of the liver and 
genes which are induced in response to hypoxia in this cell type are relevant to development of 
diagnostics and therapeutics towards liver diseases involving hypoxia, including cirrhosis. 
Metallothionein 2 A is preferentially induced by hypoxia in hepatocytes. Macrophages are key to several 

25 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Metallothionein 2 A is 
induced in macrophages activated by LPS and gamma interferon and also induced in macrophages 
activated by IL-15. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 

30 found. In a series of 5 patients with either ovarian or breast cancer, Metallothionein 2A is down-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clones pi B20 and plB21 represent Osteopontin. The protein sequence encoded 
by Osteopontin is represented in the public databases by the accession NP_000573 and is described in this 
patent by Seq ID 267. The nucleotide sequence is represented in the public sequence databases by the 
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accession NM_000582 and is described in this patent by Seq ID 268. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art 
that the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated 
5 by hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 
therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. Osteopontin is preferentially induced by hypoxia in monocytes or macrophages and a restricted 
number of other cell types. Osteopontin has been shown recently to play a role in autoimmune disease 

10 [Chabas et al, 2001, Science 294: 1731-5], We present a new association between the hypoxic response 
and autoimmune disease. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, .macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Osteopontin is repressed in macrophages activated by LPS and gamma interferon. 

15 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Osteopontin is up-regulated in the malignant tissue as 
compared to adjacent normal tissue in al least one patient. 

The Oxford BioMedica clones pi C 1 7 and p 1 C 1 8 represent Granulin. The protein sequence encoded by 
Granulin is represented in the public databases by the accession NP_002078 and is described in this 

20 patent by Seq ID 269. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_002087 and is described in this patent by Seq ID 270. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

25 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Granulin is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Granulin is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 

30 patient. The up-regulation of Granulin, which is a known growth factor, is a clinically significant feature 
of the hypoxic response with clear diagnostic and therapeutic utility. 

The Oxford BioMedica clone plD8 represents Hypoxia-inducible protein 2. The- protein sequence 
encoded by Hypoxia-inducible protein 2 is represented in the public databases by the accession 
NP_037464 and is described in this patent by Seq ID 271. The nucleotide sequence is represented in the 
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public sequence databases by the accession NM_0I3332 and is described in this patent by Seq ID 272. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia-inducible protein 2 is induced in macrophages treated with the inhibitory cytokine IL- 
5 10. Hypoxia-inducible protein 2 is repressed in macrophages activated by IL-17 and is also repressed in 
macrophages activated by IL-1 5. 

The Oxford BioMedica clone plAlO represents Enolase 2. The protein sequence encoded by Enolase 2 is 
represented in the public databases by the accession NP_001966 and is described in this patent by Seq ID 
273. The nucleotide sequence is represented in the public sequence databases by the accession 

10 NM.001975 and is described in this patent by Seq ID 274. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

15 expression responses to both hypoxia and cytokines are especially relevant. Enolase 2 is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Enolase 2 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. We expect it to have an anti-inflammatory role. 

20 The Oxford BioMedica clone pi G24 represents Glycogen synthase 1. The protein sequence encoded by 
Glycogen synthase 1 is represented in the public databases by the accession NP_002094 and is described 
in this patent by Seq ID 275. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_002103 and is described in this patent by Seq ID 276. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 

25 and have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Glycogen synthase 1 is 
repressed in macrophages activated by IL-17 and is also repressed in macrophages activated by 1L-15. 

30 The Oxford BioMedica clone plG23 represents ALCAM. The protein sequence encoded by ALCAM is 
represented in the public databases by the accession NPJ016I8-and is described in this patent by Seq ID 
277. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001627 and is described in this patent by Seq ID 278. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 



WO 02/46465 



PCT/C B0 1/05458 



154 

utility in the design of therapeutic, prognostic and diagnostic products. In view of the recently-discovered 
role of ALCAM in angiogenesis [Ohneda et al, 2001, Blood 2001 Oct I ;98(7):2 1 34-42], our 
demonstration of hypoxic regulation of ALCAM has great clinical significance in the treatment and 
diagnosis of vascular disease and cancer. 

5 TheOxford BioMedica clone pi G7 represents an u'nannotafed EST. The protein sequence encoded by this 
EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession BC008022 and is described in this patent 
by Seq ID 282. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

10 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. The EST represented by Seq ID 282 is repressed in macrophages 
activated by LPS and gamma interferon. We expect the product of EST represented by Seq ID 282 to 

15 have an anti-inflammatory role. 

The Oxford BioMedica clone p2A23 represents Chitinase 3-like 2. The protein sequence encoded by 
Chitinase 3-like 2 is represented in the public databases by the accession NP_003991 and is described in 
this patent by Seq ID 283. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_004000 and is described in this patent by Seq ID 284. Hypoxia is an important feature of 
20 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found 
in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Chitinase 3-like 2 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plGl represents BACH1. The protein sequence encoded by BACH1 is 
represented in the public databases by the accession NP_001 177 and is described in this patent by Seq ID 
285. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_00II86 and is described in this patent by Seq ID 286. BACH1, a novel helicase-Iike protein, 
interacts directly with BRCAI and contributes to its DNA repair function. Hypoxia is an important 

30 feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. The 
induction by hypoxia of this known transcriptional repressor and potential oncogene [Cantor et al 2001, 
Cell 105:149-60J is a very significant finding with profound implications for the diagnosis and treatment 
of cancer. 
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The Oxford BioMedica clone plG15 represents Phosphoglucomutase I. The protein sequence encoded by 
Phosphoglucomutase I is represented in the public databases by the accession NP_002624 and is 
described in this patent by Seq ID 287. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_002633 and is described in this patent by Seq ID 288. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Phosphoglucomutase 1 is induced in macrophages treated with the inhibitory cytokine IL-IO. 

The Oxford BioMedica clone plF23 represents Hypothetical protein LOC51014. The protein sequence 
encoded by Hypothetical protein LOC51014 is represented in the public databases by the accession 

10 Q9Y3B3 and is described in this patent by Seq ID 289. The nucleotide sequence is represented in the 
public sequence databases by the accession AF151867 and is described in this patent by Seq ID 290. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 

15 a series of 5 patients with either ovarian or breast cancer, Hypothetical protein LOC510I4 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi G8 represents Sin3-associated polypeptide. The protein sequence 
encoded by Sin3-associated polypeptide is represented in the public databases by the accession 
NPJ03855 and is described in this patent by Seq ID 291. The nucleotide sequence is represented in the 
20 public sequence databases by the accession NM_003864 and is described in this patent by Seq ID 292. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi G 1 3 represents ABCA1. The protein sequence encoded by ABCAI is 
25 represented in the public databases by the accession NP J05493 and is described in this patent by Seq ID 
293. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ05502 and is described in this patent by Seq ID 294. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
30 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. ABCAI is repressed in 
macrophages activated by LPS and gam ma- interferon. The hypoxia induction of ABCAI, which is known 
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to be relevant to atherosclerosis [Kielar et al 2001 , Clin Chem 47:2089-97], has profound implications for 
the diagnosis and treatment of this disease. 

The Oxford BioMedica clone plGlO represents SEC24 member A. The protein sequence encoded by 
SEC24 member A is represented in the public databases by the accession CAA 10334 and is described in 
5 this patent by Seq ID 295. The nucleotide sequence is represented in the public sequence databases by the 
accession A J 1 3 1 244 and is described in this patent by Seq ID 296. Hypoxia is an important feature of 
several. diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plF24 represents Glia-derived nexin. The protein sequence encoded by 
10 Glia-derived nexin is represented in the public databases by the accession AAA35883 and is described in 
this patent by Seq ID 297. The nucleotide sequence is represented in the public sequence databases by the 
accession M 17783 and is described in this patent by Seq ID 298. Glia-derived nexin is a glycoprotein that 
functions as a serine protease inhibitor with activity towards thrombin, trypsin and urokinase. It is known 
to play a role in neuro-degeneration [Seidel et al 1998, Brain Res Mol Brain Res 60:296-300]. Thus the 
15 hypoxia induction of this gene is highly significant for the diagnosis and treatment of neuro-degenerative 
disease. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are 
therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
20 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Glia-derived .nexin is induced in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Glia-derived nexin is down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plG2 represents Postsynaptic density-95. The protein sequence encoded by 
Postsynaptic density-95 is represented in the public databases by the accession NP_001356 and is 
described in this patent by Seq ID 299. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_001365 and is described in this patent by Seq ID 300. Postsynaptic 
density-95 belongs to the MAGUK family of cell junction proteins. Hypoxia is an important feature of 

30 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. The recent demonstration for 
a possible role for Postsynaptic density-95 in ischaemic pre-conditioning [Tauskela et al 2001, 
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Neuroscience 107:571-584] underlines the medical significance of our determination of the hypoxic 
regulation of this gene, and its utility in the diagnosis and treatment of ischaemic disease. 

The Oxford BioMedica clone plGIl represents Tumour protein D52. The protein sequence encoded by 
Tumour protein D52 is represented in the public databases by the accession NP_005070 and is described 
5 in this patent by Seq ID 301 . The nucleotide sequence is represented in the public sequence databases by 
the accession NM_OO5079 and is described in this patent by Seq ID 302. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. The response of renal 
epithelial cells to hypoxia is pertinent to kidney failure, especially regarding the m edullary. tissue. Tumour 
10 protein D52 is preferentially induced by hypoxia in renal epithelial cells. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, Tumour protein D52 is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. Our observation of hypoxia-regulation of this tumour-associated 
protein is highly significant for the diagnosis and treatment of cancer. 

15 The Oxford BioMedica clone p 1 G 1 6 represents p27, Kipl. The protein sequence encoded by p27, Kipl is 
represented in the public databases by the accession NP_004055 and is described in this patent by Seq ID 
303. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004064 and is described in this patent by Seq ID 304. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

20 utility in the design of therapeutic, prognostic and diagnostic products. The hypoxia regulation of this 
anti-mitogen has important utility in oncology and angiogenesis [Fouty et al 2001, Am J Respir Cell Mol 
Biol 25:652-658]. 

The Oxford BioMedica clone plG9 represents PI-3-kinase, catalytic, beta polypeptide. The protein 
sequence encoded by PI-3-kinase, catalytic, beta polypeptide is represented in the public databases by the 

25 accession NP_006210 and is described in this patent by Seq ID 305. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_006219 and is described in this patent 
by Seq ID 306. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

30 contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. PI-3-kinase, catalytic, beta polypeptide is repressed in macrophages 
activated by LPS and gamnra interferon. The very recent publication of a role for PI3 kinase in 
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angiogenesis induced by hypoxic pre-conditioning [Zhu et al 2001 s FEBS Lett 508:369-74] re-enforces 
the clinical utility which we claim for this gene as a result of its hypoxia-induclion. 

The Oxford BioMedica clone piG4 represents SLC5A3. The protein sequence encoded. by SLC5A3 is 
represented in the public databases by the accession AAC39548 and is described in this patent by Seq ID 
5 307. The nucleotide sequence is represented in the public sequence databases by the accession AF027153 
and is described in this patent by Seq ID 308. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 
design of therapeutic, prognostic and diagnostic products. SLC5A3 is over-expressed in the brains of 
children with Downs Syndrome, and may play a role in brain pathology [Berry et al 1999, J Pediatr 

10 135:94-7]. Thus our claims of clinical utility following from hypoxia induction have great medical 
significance for the diagnosis and treatment of ischaemic and degenerative disease. Macrophages are key 
to several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages 
are frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SLC5A3 is repressed in 

15 macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone p 1 G 1 4 represents Cytohesin binding protein. The protein sequence encoded 
by Cytohesin binding protein is represented in the public databases by the accession NPJ04279 and is 
described in this patent by Seq ID 309. The nucleotide sequence is represented in the public sequence 
databases by the accession NM.004288 and is described in this patent by Seq ID 310. Hypoxia is an 
20 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Cytohesin has been shown to modulate PI3-kinase activity [Dierks et al 2001, J Biol Chem 276:37472- 
81], re-enforcing our claim here and elsewhere in this filing of the relevance to the hypoxic response of 
pathways controlled by the critical second-messenger PI3. 

25 The Oxford BioMedica clones plA5 and plA6 represent SLC2A5. The protein sequence encoded by 
SLC2A5 is represented in the public databases by the accession NP_003030 and is described in this 
patent by Seq ID 311. The nucleotide sequence is represented in the public sequence databases by the 
accession NMJ03039 and is described in this patent by Seq ID 312. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

30 have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SLC2A5 is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
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where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
SLC2A5 is up-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clones plB6, plB7, plB8 and plB9 represent Adipophilin. The protein sequence 
5 encoded by Adipophilin is represented in the public databases by the accession N P_00 ! 1 1 3 and is 
. described in this patent by Seq ID 313. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_001122 and is described in this patent by Seq ID 314. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 

10 a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific* therapeutic products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD. The hypoxia induction of adipophilin lias profound implications for the 

15 causation, diagnosis and treatment of atherosclerosis, because this protein plays a key role in the uptake 
of lipid and foam cell formation [Buechler et al 2001, Biochim Biophys Acta 1532:97-104]. Adipophilin 
is preferentially induced by hypoxia in monocytes or macrophages and a restricted number of other cell 
tyVes. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 

20 present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Adipophilin is repressed in macrophages activated by LPS and gamma' interferon and is also 
repressed in macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 

25 utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Adipophilin is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Adipophilin is up-regulated and also down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

30 The Oxford BioMedica clone pi G 1 7 represents Early development regulator 2. The protein sequence 
encoded by Early development regulator 2 is represented in the public databases by the accession 
NP_0044I8 and is described in this patent by Seq ID 315. The nucleotide sequence is represented in the 
public sequence databases by the accession NM J)04427 and is described in this patent by Seq ID 316. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
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implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
5 relevant. Early development regulator 2 is repressed in macrophages activated by LPS and gamma 
interferon. 

The Oxford BioMedica clone plG3 represents B-cell translocation gene I. The protein sequence encoded 
by B-cell translocation gene 1 is represented in the public databases by the accession NP_001722 and is 
described in this patent by Seq ID 317. The nucleotide sequence is represented in the public sequence 

10 databases by the accession NM_001731 and is described in this patent by Seq ID 318. Hypoxia is an 
. important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 

15 sites, so gene expression responses to both hypoxia and cytokines are especially relevant. B-cell 
translocation gene 1 is induced in macrophages activated by LPS and gamma interferon. Hypoxia is 
frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, B -cell translocation gene 1 is down-regulated in the malignant tissue 
as compared to adjacent normal tissue in at least one patient. 

20 The Oxford BioMedica clone plF22 represents Sorting nexin 9. The protein sequence encoded by Sorting 
nexin 9 is represented in the public databases by the accession NP_057308 and is described in this patent 
by Seq ID 319. The nucleotide sequence is represented in the public sequence databases by the accession 
NM JM6224 and is described in this patent by Seq ID 320. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

25 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease- sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. TNFalpha is an inflammatory 
cytokine, which acts on macrophages, and has been shown to be central to the pathophysiology and 

30 treatment of diseases including rheumatoid arthritis. Genes that change in expression in response to 
TNFalpha therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. Sorting nexin 9 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone p)G12 represents Cyclin G2. The protein sequence encoded by Cyclin G2 
is represented in the public.databases by the accession NP_004345 and is described in this patent by Seq 
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ID 321. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004354 and is described in this patent by Seq ID 322. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
5 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Cyclin G2 is repressed in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plFll represents Hypothetical protein LOC51754. The protein sequence 
10 encoded by Hypothetical protein LOC51754 is represented in the public databases by the accession 
XPJ)49657 and is described in this patent by Seq ID 323. The nucleotide sequence is represented in the 
public sequence databases by the accession AL1 37430 and is described in this patent by Seq ID 324. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
15 products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Hypothetical protein LOC51754 is repressed in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
20 a series of 5 patients with either ovarian or breast cancer, Hypothetical protein LOC51754 is up-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 Fl 4 represents Butyrate response- factor 1. The protein sequence encoded 
by Butyrate response factor 1 is represented in the public databases by the accession NP_0049 17 and is 
described in this patent by Seq ID 327. The nucleotide sequence is represented in the public sequence 

25 databases by the accession NM_004926 and is described in this patent by Seq ID 328. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
HIFI alpha and EPA SI are transcription factors which mediate the response to hypoxia of several genes, 
and have them selves been implicated in specific diseases. By adenoviral over-expression of EPAS1 we 

30 show augmentation of the hypoxic induction of certain genes, further confirming their status as 
responsive to hypoxia. Butyrate response factor 1 has been shown to be induced by hypoxia to a greater 
degree following adenoviral over-expression of EPASI . Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
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Butyrate response factor 1 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plF17 represents P8 protein (candidate of metastasis 1). The protein 
sequence encoded by P8 protein (candidate of metastasis 1) is represented in the public databases by the 
5 accession NP_036517 and is described in this patent by Seq ID 329. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_012385 and is described in this patent 
by Seq ID 330. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 

10 contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and- 
cytokines are especially relevant. P8 protein (candidate of metastasis I) is induced in macrophages 
activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian ot breast cancer, P8 

15 protein (candidate of metastasis 1) is up-regulated and also down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clones pi C 1 and plC2 represent CXCR4. The protein sequence encoded by 
CXCR4 is represented in the public databases by the accession NP_003458 and is described in this patent 

♦ • by Seq ID 331. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_003467 and is described in this patent by Seq ID 332. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several- 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

25 expression responses to both hypoxia and cytokines are especially relevant. CXCR4 is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
on- macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 

30 CXCR4 is induced in macrophages activated by TNFalpha. CXCR4 may act through the PI3-K pathway. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, CXCR4 is up-regulated and also down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 



WO 02/46465 



PCT/CBIM/0545X 



163 

The Oxford BioMedica clone plF3 represents Hypothetical protein XP_01713I. The protein sequence 
encoded by Hypothetical protein XPJH7131 is represented in the public databases by the accession 
XP_0I7I31 and is described in this patent by Seq ID 333. The nucleotide sequence is represented in the 
public sequence databases by the accession X M _0 1 7 1 3 J and is described in this patent by Seq ID 334. 
5 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
* processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
10 relevant. Hypothetical protein X P_0 17131 is repressed in macrophages activated by LPS and gamma 
interferon. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5. patients with either ovarian or breast cancer, Hypothetical protein XP_01 7131 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi F20 represents Proline-rich protein with nuclear targeting signal. The 

15 protein sequence encoded by Proline-rich protein with nuclear targeting signal is represented in the public 
databases by the accession NP_006804 and is described in this patent by Seq ID 335. The nucleotide 
sequence is represented in the public sequence databases by the accession NM_0068I3 and is described 
in. this patent by Seq ID 336. Hypoxia is an important feature of several diseases, and genes that respond 
to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

20 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. TNFalpha is an inflammatory cytokine, which acts on macrophages, 
and has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 

25 arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Proline-rich protein 
with nuclear targeting signal is induced in macrophages activated by TNFalpha. Hypoxia is frequently 
found in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Proline-rich protein with nuclear targeting signal is down-regulated in the 

30 malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plF6 represents Hypothetical protein hqp0376. The protein sequence 
encoded by Hypothetical protein hqp0376 is represented in the public databases by the accession T08745 
and is described in this patent by Seq ID 337. The nucleotide sequence is represented in the public 
sequence databases by the accession AF078844 and is described in this patent by Seq ID 338. 
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Hypothetical protein hqp0376 is a putative dead box protein as it shows high structural similarity to dead 
box proteins and yeast initiation factor 4A. Hypoxia is an important feature of several diseases, and genes 
that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the design of 
therapeutic, prognostic and diagnostic products. H1F1 alpha and EPAS1 are transcription factors which 
5 mediate the response to hypoxia of several genes, and have them selves been implicated in specific 

diseases. By adenoviral over-expression of H IF1 alpha we show augmentation of the hypoxic induction of 
certain genes, further confirming their status as responsive to hypoxia. Hypothetical protein hqp0376 has 
been shown to be induced by hypoxia to a greater degree following adenoviral over-expression of 
HlFlalpha. Hepatocytes are the main cell type of the liver and genes which are induced in response to 

10 hypoxia in this cell type are relevant to development of diagnostics and therapeutics towards liver 

diseases involving hypoxia, including cirrhosis. Hypothetical protein hqp0376 is preferentially induced by 
hypoxia in hepatocytes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 

15 especially relevant. Hypothetical protein hqp0376 is induced in macrophages activated by LPS and 
gamma interferon. 

The Oxford BioMedica clone plF4 represents CYP1. The protein sequence encoded by CYP1 is 
represented in the public databases by the accession NP_000776 and is described in this patent by Seq ID 
339. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_000785 and is described in this patent by Seq ID 340. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 
the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 

25 therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. CYP1 is preferentially induced by hypoxia in monocytes or macrophages. Macrophages are key 
to several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages 
are frequently activated by cytokines, which have been shown to be present at disease sites, so gene 

30 expression responses to both hypoxia and cytokines are especially relevant. CYPI is induced in 
. macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
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utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
CYP1 is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone p 1 F 1 5 represents SHB adaptor protein. The protein sequence encoded by 
SHB adaptor protein is represented in the public databases by the accession NP_003019 and is described 
5 in this patent by Seq ID 34 1 . The nucleotide sequence is represented in the public sequence databases by 
the accession NM_003028 and is described in this patent by Seq ID 342. Hypoxia is an important feature 
of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. SHB adaptor protein 
participates in tyrosine kinase-mediated signalling and the regulation of angiogenesis and apotosis 

10 [Dixelius J. 2000, Blood 95:3403-1 1], Our surprising observation of the hypoxia regulation of this protein 
has clear medical utility in the diagnosis and treatment of vascular disease and cancer. Macrophages are 
key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. SHB adaptor protein 

15 is repressed in macrophages activated by LPS and gamma interferon. . 

The Oxford BioMedica clone plF13 represents Papillomavirus regulatory factor PRF-1. The protein 
sequence encoded by Papillomavirus regulatory factor PRF-1 is represented in the public databases by the 
accession NP_061 1 30 and is described in this patent by Seq ID 343. The nucleotide sequence is 
represented in the public sequence databases by the accession AK023418 and is described in this patent 

20 by Seq ID 344. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 

25 cytokines are especially relevant. Papillomavirus regulatory factor PRF-1 is repressed in macrophages 
activated by LPS and gamma interferon. 

The Oxford BioMedica clone pi A7 represents SLC31 A2. The protein sequence encoded by SLC31 A2 is 
represented in the public databases by the accession NP_00 1 85 1 and is described in this patent by Seq ID 
345. The nucleotide sequence is represented in the public sequence databases by the accession 
30 NM_001 860 and is described in this patent by Seq ID 346. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
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expression responses to both hypoxia and cytokines are especially relevant. SLC31A2 is induced in 
macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plA21 represents UDP-glucose pyrophosphorylase 2. The protein sequence 
encoded by UDP-glucose pyrophosphorylase 2 is represented in the public databases by the accession 
5 NP_006750 and is described in this patent by Seq ID 347. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_006759 and is described in this patent by Seq ID 348. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

10 The Oxford BioMedica clones plB4 and plB5 represent Proline 4-hydroxylase, alpha polypeptide II. The 
protein sequence encoded by Proline 4-hydroxylase, alpha polypeptide II is represented in the public 
databases by the accession NPJ04190 and is described in this patent by Seq ID 349. The nucleotide 
sequence is represented in the public sequence databases' by the accession NM.004199 and is described 
in this patent by Seq ID 350. Hypoxia is an important feature of several diseases, and genes that respond 

15 to this stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Proline 4-hydroxyIase, alpha polypeptide II is repressed in macrophages 

20 activated by LPS and gamma interferon and is also repressed in macrophages activated by IL-15. 
TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central to 
the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Proline 4-hydroxylase, alpha polypeptide II is 

25 induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, Proline 
4-hydroxylase, alpha polypeptide II is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clones plB 10, plB 1 1 and pi B 12 represent Stearoyl-CoA desaturase. The protein 
30 sequence encoded by Stearoyl-CoA desaturase is represented in the public databases by the accession 
NP_005054 and is described in this patent by Seq ID 351. The nucleotide sequence is represented in the 
public sequence databases by the accession NM J05063 and is described in this patent by Seq ID 352. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products. Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be 
5 central to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change 
in expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. Stearoyl-CoA desaturase is induced in 
macrophages activated by TNFalpha. 

The Oxford BioMedica clone p 1 B 1 3 represents Diacylglycerol kinase, zeta. The protein sequence 
10 encoded by Diacylglycerol kinase, zeta is represented in the public databases by the accession 
NPJ)03637 and is described in this patent by Seq ID 353. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_003646 and is described in this patent by Seq ID 354. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
15 products. 

The Oxford BioMedica clone plB22 represents Protease, serine, II. The protein sequence encoded by 
Protease, serine, 1 1 is represented in the public databases by the accession NPJ02766 and is described in 
this patent by Seq ID 355. The nucleotide sequence is represented in the public sequence databases by the 
accession NM.002775 and is described in this patent by Seq ID 356. Hypoxia is an important feature of 

20 several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression .responses to both hypoxia and cytokines are especially relevant. Protease, serine, II is 

25 repressed in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human 
tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Protease, serine, 11 is down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone pi B23 represents Interleukin 1 receptor antagonist. The protein sequence 
30 encoded by Interleukin 1 receptor antagonist is represented in the public databases by the accession 
NPJ100568 and is described in this patent by Seq ID 357. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000577 and is described in this patent' by Seq ID 358. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
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products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
5 rheumatoid arthritis, atherosclerosis, cancer, COPD. Interleukin 1 receptor antagonist is preferentially 
induced by hypoxia in monocytes or macrophages. TNFalpha is an inflammatory cytokine, which acts on 
macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
10 Interleukin 1 receptor antagonist is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone pi B24 represents NS1 -binding protein. The protein sequence encoded by 
NSI -binding protein is represented in the public, databases by the accession NP_006460 and is described 
in this patent by Seq ID 359. The nucleotide sequence is represented in the public sequence databases by 
the accession NM_006469 and is described in this patent by Seq ID 360. Hypoxia is an important feature 
15 of several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis 
and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plC3 represents Activin A receptor, type I. The protein sequence encoded 
by Activin A receptor, type I is represented in the public databases by the accession NP_001 096 and is 
described in this patent by Seq ID 361. The nucleotide sequence is represented in the public sequence 

20 databases by the accession NM_001 105 and is described in this patent by Seq ID 362. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. Activin 
A is known to induce apoptosis [Hughes et al 1999, Prog Neurobiol 57:421-50], and so the regulation of 
its receptor by hypoxia has clear clinical significance. Hypoxia is frequently found in human tumours 

25 where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Activin A receptor, type I is down-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

The Oxford BioMedica clone plC4 represents FGF receptor activating protein 1. The protein sequence 
encoded by FGF receptor activating protein 1 is represented in the public databases by the accession 
30 NP_055304 and is described in this patent by Seq ID 363. The nucleotide sequence is represented in the 
public sequence databases by the accession NM JH4489 and is described in this patent by Seq ID 364. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. FGF has been shown to enhance survival of cardiac cells after ischaemic insult [Sheikh et al 
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2001, Am J Physiol Heart Circ Physiol 280:H 1 039-50], and so our observation of the hypoxia-regulation 
of the FGF receptor activating protein 1 is highly significant for the diagnosis and treatment of ischaemic 
disease. FGF receptor activating protein I is induced in macrophages treated with the inhibitory cytokine 
IL-I0. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a 
5 series of 5 patients with either ovarian or breast cancer, FGF receptor activating protein 1 is up-regulated 
in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plC5 represents Galectin 8. The protein sequence encoded by Galectin 8 is 
represented in the public databases by the accession NPJ06490 and is described in this patent by Seq ID 
365. The nucleotide sequence is represented in the public sequence databases by the accession 
10 NM.006499 and is described in this patent by Seq ID 366. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Galectin 8 is an important tumour 
marker [for review see Bidon et al 2001, Int J Mol Med 8:245-50], and so its hypoxia-regulation is highly 
significant clinically. 

15 The Oxford BioMedica clone plC6 represents Glucose phosphate isomerase. The protein sequence 
encoded by Glucose phosphate isomerase is represented in the public databases by the accession 
NP_000166 and is described in this patent by Seq ID 367. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000175 and is described in this patent by Seq ID 368. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products, Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory 
processes. In these, macrophages are frequently activated by cytokines, which have been shown to be 
present at disease sites, so gene expression responses to both hypoxia and cytokines are especially 
relevant. Glucose phosphate isomerase is induced in macrophages activated by IL-17 and also induced in 

25 macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and" diagnostic products for such inflammatory conditions. Glucose phosphate 
isomerase is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 

30 tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Glucose phosphate isomerase is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 
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The Oxford BioMedica clone plC7 represents D123. The protein sequence encoded by D123 is 
represented in the public databases by the accession NP_0060I4 and is described in this patent by Seq ID 
369. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J106023 and is described in this patent by Seq ID 370. Hypoxia is an important feature of several 
5 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. DI23 is repressed in 

10 macrophages activated by LPS and gamma interferon and is also repressed in macrophages activated by 
IL-15. D123 protein is an important regulator of the cell cycle [Onisto et al 1998, Exp Cell Res 242:451- 
.9], Recently it has been shown to be regulated by modification and turnover [Okuda et al 2001, Cell 
Struct Funct 26:205-14]. We have shown the hypoxia-regulation of this protein, and also of several prolyl 
hydroxylases which are known to. target proteins for ubiquitination and proteasomal degradation. We 

15 believe that concerted hypoxic control of D 123 and its regulating prolyl hydroxylase is part of the means 
of hypoxic regulation of cell growth and tissue re-modelling. 

The Oxford BioMedica clone pIC8 represents DEC-1. The protein sequence encoded by DEC-1 is 
represented in the public databases by the accession NP_003661 and is described in this patent by Seq ID 
371. The nucleotide sequence is represented in the public sequence databases by the accession 

20 NM_003670 and is described in this patent by Seq ID 372. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. DEC-1 is a helix-loop-helix 
transcription factor, and its hypoxia-regulation is highly significant. The response of renal epithelial cells 
to hypoxia is pertinent to kidney failure, especially regarding the medullary tissue. DEC-1 is 

25 preferentially induced by hypoxia in renal epithelial cells. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
DEC-1 is up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone plC9 represents RAB-8b protein. The protein sequence encoded by RAB- 
30 8b protein is represented in the public databases by the accession NPJ)57614 and is described in this 
patent by Seq ID 373. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_016530 and is described in this patent by Seq ID 374. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. The hypoxia regulation of 
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this small GTP-ase, which is involved in intracellular membrane trafficking, is highly significant. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. RAB-8b protein 
5 is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone pICIO represents Regulator of G-protein signalling I. The protein sequence 
encoded by Regulator of G-protein signalling 1 is represented in the public databases by the accession 
NP_002913 and is described in this patent by Seq ID 375. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002922 and is described in this patent by Seq ID 376. 

10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 

15 types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Regulator of G-protein signalling 1 is preferentially 
induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases involving 
hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 

20 hypoxia and cytokines are especially relevant. Regulator of G-protein signalling I is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
Regulator of G-protein signalling 1 is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

25 The Oxford BioMedica clone plCll represents Polyubiquitin. The protein sequence encoded by 
Polyubiquitin is represented in the public databases by the accession BAA23632 and is described in this 
patent by Seq ID 377. The nucleotide sequence is represented in the public sequence databases by the 
accession AB009010 and is described in this patent by Seq ID 378. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 

30 have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene- 
expression responses to both hypoxia and cytokines are especially relevant. Polyubiquitin is repressed in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
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on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Polyubiquitin is induced in macrophages activated by TNFalpha. Hypoxia is frequently found in human 
5 tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast 
cancer, Polyubiquitin is up-regulated in the malignant tissue as compared to adjacent normal tissue in at 
least one patient. 

The Oxford BioMedica clone plC12 represents Integrin, alpha 5. The protein sequence encoded by 
Integrin, alpha 5 is represented in the public databases by the accession NP_002196 and is described in 

10 this patent by Seq ID 379. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_002205 and is described in this patent by Seq ID 380. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Integrin, alpha 5 may play a 
role in the response to neuronal injury [King et al 2001, J Neurocytol 30:243-52]. Our observation of 

15 hypoxia regulation of both COX-2 and integrin, alpha 5 supports the very recent suggestion that they may 
both function in recovery from cardiovascular injury [Hein et al 2001, Am J Physiol Heart Circ Physiol 
281:H2378-84], which is pre-figured by our claims, Integrin, alpha 5 is induced by hypoxia in mammary 
epithelial cells, and may play an important role in cancer progression in that tissue through its function of 
regulating interaction with the extracellular matrix. 

20 The Oxford BioMedica clone pi C 1 3 represents Jk-recombination signal binding protein. The protein 
sequence encoded by Jk-recombination signal binding protein is represented in the public databases by 
the accession AAA60258 and is described in this patent by Seq ID 381. The nucleotide sequence is 
represented in the public sequence databases by the accession L07872 and is described in this patent by 
Seq ID 382. Hypoxia is an important feature of several diseases, and genes that respond to this stimulus 

25 are therefore implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and 
diagnostic products. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Jk-recombination signal binding protein is repressed in macrophages activated by 

30 LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has 
been shown to be central to the pathophysiology and treatment of diseases including rheumatoid arthritis. 
Genes that change in expression in response to TNFalpha therefore have utility in the design of 
therapeutic, prognostic and diagnostic products for such inflammatory conditions. Jk-recombination 
signal binding protein is induced in macrophages activated by TNFalpha. The important role of Jk- 
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recombination signal binding protein in the regulation of the immune response is thus modulated by 
hypoxia, and there are potentially many ways of exploiting that modulation in the design of diagnostics 
and therapeutics. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, Jk-recom bination signal binding 
5 protein is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. It is of particular interest and significance, in view of the escape from immunological surveillance 
of many tumours, that Jk-recombination signal binding protein is down-regulated. 

The Oxford BioMedica clone plC14 represents Abstrakt. The protein sequence encoded by Abstrakt is 
represented in the public databases by the accession NP_057306 and is described in this patent by Seq ID 

10 383. The nucleotide sequence is represented in the public sequence databases bv the accession 
NM_016222 and is described in this patent by Seq ID 384. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

15 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Abstrakt is repressed in 
macrophages activated by IL-15. TNFalpha is an inflammatory cytokine, which acts on macrophages, and 
has been shown to be central to the pathophysiology and treatment of diseases including rheumatoid 
arthritis. Genes that change in expression in response to TNFalpha therefore have utility in the design of 

20 therapeutic, prognostic and diagnostic products for such inflammatory conditions. Abstrakt is induced in 
macrophages activated by TNFalpha. The general role of Abstrakt in the regulation of gene expression 
[Schmucker et al 2000, Mech Dev 91:1 89-96] implies particular significance to the recovery of cells from 
hypoxic insult. 

The Oxford BioMedica clone plC15 represents High-mobility group protein 2. The protein sequence 
25 encoded by High-mobility group protein 2 is represented in the public databases by the accession 
NP_002i20 and is described in this patent by Seq ID 385. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002129 and is described in this patent by Seq ID 386. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
30 products. 

The Oxford BioMedica clone plC16 represents Decidual protein induced by progesterone. The protein 
sequence encoded by Decidual protein induced by progesterone is represented in the public databases by 
the accession NP_008952 and is described in this patent by Seq ID 387. The nucleotide sequence is 
represented in the public sequence databases by the accession NM J)07021 and is described in this patent 
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by Seq ID 388. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. Decidual protein induced by progesterone is preferentially induced by 
hypoxia in mammary epithelial cells. Human decidual cells have not been tested, but we predict that 
5 Decidual protein induced by progesterone is hypoxia-regulated in those cells. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. Decidual protein induced by 
progesterone is repressed in macrophages activated by IL-17. 

10 The Oxford BioMedica clone pi CI 9 represents GM2 ganglioside activator protein. The protein sequence 
encoded by GM2 ganglioside activator protein is represented in the public databases by the accession 
NP_000396 and is described in this patent by Seq ID 389. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_000405 and is described in this patent by Seq ID 390. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

15 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 
to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 

20 rheumatoid arthritis, atherosclerosis, cancer, COPD. GM2 ganglioside activator protein is preferentially 
induced by hypoxia in monocytes or macrophages. The hypoxia-inducibility of this protein in 
macrophages is likely to be clinically very significant. It is likely to play a role in the control of 
inflammation in asthma and inflammatory bowel disease, and in lipid metabolism and 
phosphatidylinositol-mediated signalling. 

25 The Oxford BioMedica clone plC20 represents CN0T8, The protein sequence encoded by CN0T8 is 
represented in the public databases by the accession NPJ)04770 and is described in this patent by Seq ID 
391. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_004779 and is described in this patent by Seq ID 392. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. 

The protein sequence encoded by Similar to Nucleoside phosphorylase is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA430382 and is described in this patent by Seq ID 394. Hypoxia is an important 
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feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plP5 represents SCYA2. The protein sequence encoded by SCYA2 is 
represented in the public databases by the accession NP_002973 and is described in this patent by Seq ID 
5. 395. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)02982 and is described in this patent by Seq ID 396. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

10 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA2 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17 or IL-15. Thus the role of SCYA2 in monocyte recruitment (Lu et al 1998, J Exp Med 187:601-8), 
which has clear relevance to the diagnosis and treatment of cardiovascular disease, cancer, rheumatoid 

15 arthritis, atherosclerosis and COPD, is enhanced by hypoxia. TNFalpha is an inflammatory cytokine, 
which acts on macrophages, and has been shown to be central to the pathophysiology and treatment of 
diseases including rheumatoid arthritis. Genes that change in expression in response to TNFalpha 
therefore have utility in the design of therapeutic, prognostic and diagnostic products for such 
inflammatory conditions. SCYA2 is repressed in macrophages activated by TNFalpha. Hypoxia is 

20 frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, SYCA2 is down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone p2L23 represents Endothelin I. The protein sequence encoded by 
Endothelin 1 is represented in the public databases by the accession NP.001946 and is described in this 

25 patent by Seq ID 397. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_001955 and is described in this patent by Seq ID 398. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
several diseases involving hypoxia, and contribute to inflammatory processes'. In these, macrophages are 

30 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expressipn responses to both hypoxia and cytokines are especially relevant. Endothelin I is induced in 
macrophages activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts 
on macrophages, and has been shown to be central to the pathophysiology and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 



WO 1)2/46465 PCT/CBI 1 1/05458 

176 

utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
Endothelin 1 is induced in macrophages activated by TNFalpha. Endothelin 1 plays an important role in 
inducing proliferation of vascular smooth muscle cells. Its hypoxia-inducibility and thus its modulation to 
ameliorate the consequences of ischaemic insult, is of considerable clinical significance to the recovery 
5 from injury, and angiogenesis. 

The protein sequence encoded by Similar to Heat shock 70kD protein 4 is not represented in the public 
databases by a protein accession. The nucleotide sequence is represented in the public sequence databases 
by the accession AA633656 and is described in this patent by Seq ID 400. Hypoxia is an important 
feature of several diseases, and genes that respond to this stimulus are therefore implicated in the 
10 pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone pIK9 represents Lipocortin I. The protein sequence encoded by Lipocortin 
I is represented in the public databases by the accession NP_000691 and is described in this patent by Seq 
ID 401. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000700 and is described in this patent by Seq ID 402. Hypoxia is an important feature of several 

15 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Lipocortin I (also called annexin I) 
is an important anti-inflammatory mediator, and its hypoxia-inducibility has important implications for 
the diagnosis and treatment of ischaemic disease, cancer, atherosclerosis, and inflammatory diseases such 
as rheumatoid arthritis. Hypoxia is frequently found in human tumours where macrophage infiltrates are 

20 also found. In a series of 5 patients with either ovarian or breast cancer, Lipocortin I is up-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. Hypoxia is frequently 
found in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Lipocortin 1 is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

,25 The Oxford BioMedica clone plK23 represents MYC. The protein sequence encoded by MYC is 
represented in the public databases by the accession NP_002458 and is described in this patent by Seq ID 
403. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002467 and is described in this patent by Seq ID 404. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

30 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. MYC is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 



WO 02/46465 



PCT/C B0 1/05458 



177 

where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
MYC is up-regulated and also down-regulated in the malignant tissue as compared to adjacent normal 
tissue in at least one patient. 

The Oxford BioMedica clone pi K 1 5 represents AIpha-2-macroglobuIin. The protein sequence encoded 
5 by Alpha-2-macroglobulin is represented' in the public databases by the accession NP_000005 and is 
described in this patent by Seq ID 405. The nucleotide sequence is, represented in the public sequence 
databases by the accession NM .000014 and is described in this patent by Seq ID 406. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. There is 

10 a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary to this, we show 
that genes are regulated by hypoxia to a greater degree in certain cell types, substantiating their utility in 
designing specific therapeutic* products for diseases involving those cell types. Monocytes and 
macrophages have been implicated in the following diseases involving hypoxia: rheumatoid arthritis, 
atherosclerosis, cancer, COPD. AIpha-2-macroglobulin is preferentially induced by hypoxia in monocytes 

15 or macrophages. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, Alpha-2-macroglobulin is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plK8 represents SCYA4. The protein sequence encoded by SCYA4 is 
represented in the public databases by the accession XPJ08449 and is described in this patent by Seq ID 

20 407. The nucleotide sequence is represented in the public sequence databases by the accession 
XM J)08449 and is described in this patent by Seq ID 408. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

25 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA4 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
17. TNFalpha is an inflammatory cytokine, which acts on macrophages, and has been shown to be central 
to the pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 

30 expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. SCYA4 is induced in macrophages activated by 
TNFalpha. SCYA4 is a chemokine which is likely to be significant in inflammatory disease as a direct 
result of its hypoxic regulation. Hypoxia is frequently found in human tumours where macrophage 
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infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, SCYA4 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi M 24 represents Sex hormone-binding globulin. The protein sequence 
encoded by Sex hormone-binding globulin is represented in the public databases by the accession 
5 NP_001031 and is described in this patent by Seq ID 409. The nucleotide sequence is represented in the 
public sequence databases by the accession NM JJ0I040 and is described in this patent by Seq ID 410. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

10 The Oxford BioMedica clone plK7 represents ATP-binding cassette El. The protein sequence encoded 
by ATP-binding cassette El is represented in the public databases by the accession NPJ)02931 and is 
described in this patent by Seq ID 411. The nucleotide sequence is represented in the public sequence 
databases by the accession NM_002940 and is described in this patent by Seq ID 412. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 

15 the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. ATP-binding 
cassette El is repressed in macrophages activated by LPS and gamma interferon. 

20 The Oxford BioMedica clone pi K 1 6 represents CCT6A. The protein sequence encoded by CCT6A is 
represented in the public databases by the accession NP_001753 and is described in this patent by Seq ID 
413. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_001762 and is described in this patent by Seq ID 4)4. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

25 utility in the design of therapeutic/prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 K 1 8 represents Colony-stimulating factorl. The protein sequence 
encoded by Colony-stimulating factorl is represented in the public databases by the accession AAA521 17 
and is described in this patent by Seq ID 415. The nucleotide sequence is represented in the public 
sequence databases by the accession M37435 and is described in this patent by Seq ID 416. Hypoxia is an 
30 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
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sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Colony- 
stimulating factorl is repressed in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone plNl represents GA17. The protein sequence encoded by GA17 is 
represented in the public databases by the accession NPJ)06351 and is described in this patent by Seq ID 
5 417. The nucleotide sequence is represented in (he public sequence databases by the accession 
NMJ06360 and is described in this patent by Seq ID 418. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plK22 represents GPR44. The protein sequence encoded by GPR44 is 
10 represented in the public databases by the accession NPJ04769 and is described in this patent by Seq ID 
419. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ04778 and is described in this patent by Seq ID 420. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
15 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. GPR44 is repressed in v 
macrophages activated by LPS and gamma interferon. GPR44 is most similar to the chemoattractant 
GPCR's [Marchese et al 1999, Genomics 1999 Feb 1 5 ;56(1 ): 1 2-2 1 ]. Our demonstration of its hypoxic 
20 regulation enables prediction of roles in diseases associated with transient hypoxia and macrophages. 
GPCR's are a druggable class of molecules, and represent an ideal route for pharmacological 
intervention. 

The Oxford BioMedica clone plK14 represents Keratin 6B. The protein sequence encoded by Keratin 6B 
is represented in the public databases by the accession NP_005546 and is described in this patent by Seq 

25 ID 421. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ05555 and is described in this patent by Seq ID 422. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Keratin 6B is induced in 
macrophages treated with the inhibitory cytokine IL-10. Keratin 6B is repressed in macrophages activated 

30 by 1L-17 and is also repressed in macrophages activated by IL-15. 

The Oxford BioMedica clone p 1 K 1 3 represents Lymphocyte adaptor protein. The protein sequence 
encoded by Lymphocyte adaptor protein is represented in the public databases by the accession 
NPJ)05466 and is described in this patent by Seq ID 423. The nucleotide sequence is represented in the 
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public sequence databases by the accession NM _005475 and is described in this patent by Seq ID 424. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

5 The Oxford BioMedica clone pi J20 represents Neuro-oncological ventral antigen 1. The protein sequence 
encoded by -Neuro-oncological ventral antigen 1 is represented in the public databases by the accession 
NP_002506 and is described in this patent by Seq ID 425. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_002515 and is described in this patent by Seq ID 426. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
10 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In 
a series of 5 patients with either ovarian or breast cancer, Neuro-oncological ventral antigen 1 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plJ22 represents Neutral sphingomyelinase (N-SMase) activation 
15 associated factor. The protein sequence encoded by Neutral sphingomyelinase (N-SMase) activation 
associated factor is represented in the public databases by the accession NP_00357I and is described in 
this patent by Seq ID 427. The nucleotide sequence is represented in the public sequence databases by the 
accession NM_003580 and is described in this patent by Seq ID 428. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
20 have utility in the design of therapeutic, prognostic and diagnostic products. Neutral sphingomyelinase 
(N-SMase) activation associated factor is induced in macrophages treated with the inhibitory cytokine IL- 
10. Neutral sphingomyelinase (N-SMase) activation associated factor is repressed in macrophages 
activated by IL-17 and is also repressed in macrophages activated by IL-15. We expect activation of of 
Neutral sphingomyelinase (N-SMase) to have an anti-inflammatory effect. This enzyme is known to 
25 modulate the sphingomyelin second messenger cycle, potentially interacting with the oxidative system. 
Our demonstration of hypoxic regulation provides a crucial indication of the benefit of therapeutic 
intervention via sphingomyelinase (N-SMase) for the treatment of inflammatory diseases and diseases 
related to the hypoxic macrophage. 

The Oxford BioMedica clone pi K 1 represents Cyclophilin F. The protein sequence encoded by 
30 Cyclophilin F is represented in the public databases by the accession NP_005720 and is described in this 
patent by Seq ID 429. The nucleotide sequence is represented in the public sequence databases by the 
accession NM J)05729 and is described in this patent by Seq ID 430. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found 
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in human tumours where macrophage infiltrates are also found. In a series of 5 patients with either 
ovarian or breast cancer, Cyclophilin F is up-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clone p 1 K 3 represents Pleckstrin. The protein sequence encoded by Pleckstrin is 
5 represented in the public databases by the accession NP_002655 and is described in this patent by Seq ID 
431. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J)02664 and is described in this patent by Seq ID 432. Hypoxia is an important feature of several 
diseases, and genes .that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 

10 the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 
therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. Pleckstrin is preferentially induced by hypoxia in monocytes or macrophages. Macrophages are 

15 key to several diseases involving hypoxia, and contribute to inflammatory processes. In these, 
macrophages are frequently activated by cytokines, which have been shown to be present at disease sites, 
so gene expression responses to both hypoxia and cytokines are especially relevant. Pleckstrin is induced 
in macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 

20 Pleckstrin is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clones p 1J 1 9 and plK2 represent CFFM4. The protein sequence encoded by 
CFFM4 is represented in the public databases by the accession NP_067024 and is described in this patent 
by Seq ID 433. The nucleotide sequence is represented in the public sequence databases by the accession 

25 NM_021201 and is described in this patent by Seq ID 434. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that 
the response to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by 
hypoxia to a greater degree in certain cell types, substantiating their utility in designing specific 

30 therapeutic products for diseases involving those cell types. Monocytes and macrophages have been 
implicated in the following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, 
COPD. CFFM4 is preferentially induced by hypoxia in monocytes or macrophages. CFFM4 is induced in 
macrophages treated with the inhibitory cytokine IL-IO. It has been suggested recently that CFFM4 is 
associated with mature cellular function in the monocytic lineage and that it may be a component of a 
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receptor complex involved in signal transduction [Gingras et al 2001, Immunogenetics 53:468-76]. Our 
demonstration of hypoxic-regulation opens possible routes of intervention in macrophage-related disease 
via this potentially important cell surface receptor. Hypoxia is frequently found in human tumours where 
macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 
5 CFFM4 is down-regulated in the malignant tissue as compared to adjacent normal tissue in at least one 
patient. 

The Oxford BioMedica clone pi K5 represents Ribosomal protein L36a. The protein sequence encoded by 
Ribosomal protein L36a is represented in the public databases by the accession NP_000992 and is 
described in this patent by Seq ID 435. The nucleotide sequence is represented in the public sequence 
10 databases by the accession NM_001 001 and is described in this patent by Seq ID 436. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone p 1 J 1 7 represents SLC6A I. The protein sequence encoded by SLC6A1 is 
represented in the public databases by the accession NPJ03033 and is described in this patent by Seq ID 

15 437. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_003042 and. is described in this patent by Seq ID 438. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 

20 or breast cancer, SLC6A1 is up-regulated and also down-regulated in the malignant tissue as compared to 
adjacent normal tissue in at least one patient. 

The^Oxford BioMedica clone p I J 1 8 represents Synaptopodin. The protein sequence encoded by 
Synaptopodin is represented in the public databases by the accession NP.009217 and is described in this 
patent by Seq ID 439. The nucleotide sequence is represented in the public sequence databases by the 

25 accession NM J07286 and is described in this patent by Seq ID 440. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Synaptopodin is a component 
of the cytoskelelon which has particular importance in neurons, where it is involved in synaptic plasticity. 
Its hypoxia-regulation is clearly potentially significant in the context of neurological disease. Hypoxia is 

30 frequently found in human tumours where macrophage infiltrates are also found. In a series of 5 patients 
with either ovarian or breast cancer, Synaptopodin is up-regulated and also down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 
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The Oxford BioMedica clone p 1 J 1 5 represents TERA protein. The protein sequence encoded by TERA 
protein is represented in the public databases by the accession NPJJ67061 and is described in this patent 
by Seq ID 441. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJJ21238 and is described in this patent by Seq ID 442. Hypoxia is an important feature of several 
5 diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, TERA protein is up-regulated and also down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

10 The Oxford BioMedica clone plK4 represents TSC-22. The protein sequence encoded by TSC-22 is 
represented in the public databases by the accession NP_006013 and is described in this patent by Seq ID 
*443.' The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006022 and is described in this patent by Seq ID 444. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

15 utility in the design of therapeutic, prognostic and diagnostic products. TSC-22 is a transcriptional 
regulator of the leucine zipper class, and its hypoxic regulation is likely to have significant downstream 
effects which may be related to ischaemic disease. Thus it may provide important points of intervention in 
such diseases. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 

20 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. TSC-22 is repressed in macrophages activated by LPS and gamma interferon. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, TSC-22 is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone p2A14 represents an unannotated EST. The protein sequence encoded by 
this EST is not represented in the public databases by a protein accession. The nucleotide sequence is 
represented in the public sequence databases by the accession AA988I10 and is described in this patent 
by Seq ID 446. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 

30 prognostic and diagnostic products. The EST represented by Seq ID 446 is induced in macrophages 
treated with the inhibitory cytokine IL-10. The EST represented by Seq ID 446 is repressed in 
macrophages activated by IL-17 and is also repressed in macrophages activated by IL-15. 

The Oxford BioMedica clone p I J23 represents Calgranulin A. The protein sequence encoded by 
Calgranulin A is represented in the public databases by the accession NP_002955 and is described in this 
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patent by Seq ID 447. The nucleotide sequence is represented in the public sequence databases by the 
accession NM .002964 and is described in this patent by Seq ID 448. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Calgranulin A, called by its 
5 synonym S100A8, has been cited recently as "wound-regulated" [Thorey et ai 2001, J Biol Chem 
276:358 18-25] which provides less precise support for our prior determination of its hypoxia-regulation. 
In its potential role as a.chemoattractant, it would be an important point of intervention for the modulation 
of inflammatory processes. Macrophages are key to several diseases involving hypoxia, and contribute to 
inflammatory processes. In these, macrophages are frequently activated by cytokines, which have been 
10 shown to be present at disease sites, so gene expression responses to both hypoxia and cytokines are 
especially relevant. Calgranulin A is repressed in macrophages activated by LPS and gamma interferon. 
..Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Calgranulin A is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

15 The Oxford BioMedica clone p 1 J2 1 represents Replication factor C large subunit. The protein sequence 
encoded by Replication factor C large subunit is represented in the public databases by the accession 
NP_002904 and is described in this patent by Seq ID 449. The nucleotide sequence is represented in the 
public sequence databases by the accession NM.002913 and is described in this patent by Seq ID 450. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

20 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plJ24 represents Signal recognition particle 19kD. The protein sequence 
encoded by Signal recognition particle 19kD is represented in the public databases by the accession 
NP_003126 and is described in this patent by Seq ID 451. The nucleotide sequence is represented in the 
25 public sequence databases by the accession NM_003135 and is described in this patent by Seq ID 452. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone pi J16 represents cDNA: FLJ23019 fis, clone LNG00916. The protein 
30 sequence encoded by cDNA: FLJ230I9 fis, clone LNG00916 is not represented in the public databases 
by a protein accession. The nucleotide sequence is represented in the public sequence databases by the 
accession AK026672 and is described in this patent by Seq ID 454. Hypoxia is an important feature of 
several diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and 
have utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to 
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several diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. The cDNA: FLJ23019 fis, 
clone LNG00916 is induced in macrophages activated by LPS and gamma interferon and is also induced 
5 in macrophages activated by IL-15. 

The Oxford BioMedica clone p 1 J2 represents Proteasome subunit, alpha type, 4. The protein sequence 
encoded by Proteasome subunit, alpha type, 4 is represented in the public databases by the accession 
NP_002780 and is described in this patent by Seq ID 455. The nucleotide sequence is represented in the 
public sequence databases by the accession NM J)02789 and is described in this patent by Seq ID 456. 
10 Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone plJ9 represents MAFB. The protein sequence encoded by MAFB is 
represented in the public databases by the accession NPJ05452 and is described in this patent by Seq ID 

15 457. The nucleotide sequence is represented in the public sequence databases by the accession 
NM J0546I and is described in this patent by Seq ID 458. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. MAFB is a transcriptional 
regulator of the leucine zipper type, and is likely to play an important role in the mediation of the hypoxic 

20 response, with attendant relevance to associated diseases. 

The Oxford BioMedica clone plHO represents DNCLI2. The protein sequence encoded by DNCLI2 is 
represented in the public databases by the accession NP J06132 and is described in this patent by Seq ID 
459. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006141 and is described in this patent by Seq ID 460. Hypoxia is an important feature of several 
25 diseases, and genes that respond to this stimulus are. therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Hypoxia is frequently found in 
human tumours where macrophage infiltrates are also found. In a series of 5 patients with either ovarian 
or breast cancer, gene X is down-regulated in the malignant tissue as compared to adjacent normal tissue 
in at least one patient. 

30 The Oxford BioMedica clone plJl represents Chromobox homolog 3. The protein sequence encoded by 
Chromobox homolog 3 is represented in the public databases by the accession NP__057671 and is 
described in this patent by Seq ID 461. The nucleotide sequence is represented in the public sequence 
databases by the accession NMJ16587 and is described in this patent by Seq ID 462. Hypoxia is an 
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important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

The Oxford BioMedica clone plJ5 represents SCYA7. The protein sequence encoded' by SCYA7 is 
represented in the public databases by the accession NP_006264 and is described in this patent by Seq ID 
5 463. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_006273 and is described in this patent by Seq ID 464. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 

10 frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA7 is induced in 
macrophages activated by 1L-15. SCYA7 is a chemoattractant protein which, considering its hypoxia- 
regulation, is likely to play an important role in inflammatory and ischaemic disease. TNFalpha is an 
inflammatory cytokine, which acts on macrophages, and has been shown to be central to the 

15 pathophysiology and treatment of diseases including rheumatoid arthritis. Genes that change in 
expression in response to TNFalpha therefore have utility in the design of therapeutic, prognostic and 
diagnostic products for such inflammatory conditions. SCYA7 is repressed in macrophages activated by 
TNFalpha. 

The Oxford BioMedica clone plJll represents Fatty-acid-Coenzyme A ligase, long-chain 2. The protein 
20 sequence encoded by Fatty-acid-Coenzyme A ligase, long-chain 2 is represented in the public databases 
by the accession NP_066945 and is described in this patent by Seq ID 465. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_021 122 and is described in this patent 
by Seq ID 466. Hypoxia is an important feature of several diseases, and genes that respond to this 
stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
25 prognostic and diagnostic products. Macrophages are key to several diseases involving hypoxia, and 
contribute to inflammatory processes. In these, macrophages are frequently activated by cytokines, which 
have been shown to be present at disease sites, so gene expression responses to both hypoxia and 
cytokines are especially relevant. Fatty-acid-Coenzyme A ligase, long-chain 2 is induced in macrophages 
activated by LPS and gamma interferon and also induced in macrophages activated by IL-17 or IL-15. 
30 Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients- with either ovarian or breast cancer, Fatty-acid-Coenzyme A ligase, long-chain 2 is down- 
regulated in the malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone plJ8 represents Programmed cell death 5. The protein sequence encoded by 
Programmed ceil death 5 is represented in the public databases by the accession NP_004699 and is 
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described in this patent by Seq ID 467. The nucleotide sequence is represented in the public sequence 
databases by the accession NM J04708 and is described in this patent by Seq ID 468. Hypoxia is an 
important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 

5 The Oxford BioMedica clone pi 120 represents SCYA3L. The protein sequence encoded by SCYA3L is 
represented in the public databases by the accession CAA36397 and is described in this patent by Seq ID 
469. The nucleotide sequence is represented in the public sequence databases by the accession X52149 
and is described in this patent by Seq ID 470. Hypoxia is an important feature of several diseases, and 
genes that respond to this stimulus are therefore implicated in the pathogenesis and have utility in the 

10 design of therapeutic, prognostic and diagnostic products. There is a prejudice in the art that the response 
to hypoxia is generic to all cell types. Contrary to this, we show that genes are regulated by hypoxia to a 
greater degree in certain cell types, substantiating their utility in designing specific therapeutic products 
for diseases involving those cell types. Monocytes and macrophages have been implicated in the 
following diseases involving hypoxia: rheumatoid arthritis, atherosclerosis, cancer, COPD. SCYA3L is 

15 preferentially induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases 
involving hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently 
activated by cytokines, which have been shown to be present at disease sites, so gene expression 
responses to both hypoxia and cytokines are especially relevant. SCYA3L is induced in macrophages 
activated by LPS and gamma interferon. TNFalpha is an inflammatory cytokine, which acts on 

20 macrophages, and has been shown to be central to the pathophysiology .and treatment of diseases 
including rheumatoid arthritis. Genes that change in expression in response to TNFalpha therefore have 
utility in the design of therapeutic, prognostic and diagnostic products for such inflammatory conditions. 
SCYA3L is induced in macrophages activated by TNFalpha. 

The Oxford BioMedica clone plJ3 represents Furin. The protein sequence encoded by Furin is 
25 represented in the public databases by the accession NP_002560 and is described in this patent by Seq ID 
471. The nucleotide sequence is represented in the public sequence databases by the accession 
NMJ02569 and is described in this patent by Seq ID 472. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. 

30 The Oxford BioMedica clone p 1 J 1 2 represents Nuclear autoantigenic sperm protein. The protein 
sequence encoded by Nuclear autoantigenic sperm protein is represented in the public databases by the 
accession NP_002473 and is described in this patent by Seq ID 473. The nucleotide sequence is 
represented in the public sequence databases by the accession NM_002482 and is described in this patent 
by Seq ID 474. Hypoxia is an important feature of several diseases, and genes that respond to this 
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stimulus are therefore implicated in the pathogenesis and have utility in the design of therapeutic, 
prognostic and diagnostic products. 

The Oxford BioMedica clone pi 123 represents Ecotropic viral integration site 2A. The protein sequence 
encoded by Ecotropic viral integration site 2A is represented in the public databases by the accession 
5 NPJ)55025 and is described in this patent by Seq ID 475. The nucleotide sequence is represented in the 
public sequence databases by the accession NM JJ14210 and is described in this patent by Seq ID 476. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. There is a prejudice in the art that the response to hypoxia is generic to all cell types. Contrary 

10 to this, we show that genes are regulated by hypoxia to a greater degree in certain cell types, 
substantiating their utility in designing specific therapeutic products for diseases involving those cell 
types. Monocytes and macrophages have been implicated in the following diseases involving hypoxia: 
rheumatoid arthritis, atherosclerosis, cancer, COPD. Ecotropic viral integration site 2A is preferentially 
induced by hypoxia in monocytes or macrophages. Macrophages are key to several diseases involving 

15 hypoxia, and contribute to inflammatory processes. In these, macrophages are frequently activated by 
cytokines, which have been shown to be present at disease sites, so gene expression responses to both 
hypoxia and cytokines are especially relevant. Ecotropic viral integration site 2A is repressed in 
macrophages activated by LPS and gamma interferon. Hypoxia is frequently found in human tumours 
where macrophage infiltrates are also found. In a series of 5 patients with either ovarian or breast cancer, 

20 Ecotropic viral integration site 2A is down-regulated in the malignant tissue as compared to adjacent 
normal tissue in at least one patient. 

The Oxford BioMedica clone plJ7 represents Sjogren syndrome antigen B. The protein sequence 
encoded by Sjogren syndrome antigen B is represented in the public databases by the accession 
NPJ)03133 and is described in this patent by Seq ID 477. The nucleotide sequence is represented in the 

25 public sequence databases by the accession NM_003I42 and is described in this patent by Seq ID 478. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 
implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. Sjogren syndrome antigen B is preferentially induced by hypoxia in mammary epithelial cells. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 

30 these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. Sjogren 
syndrome antigen B is induced in macrophages activated by LPS and gamma interferon. 

The Oxford BioMedica clone pi 121 represents SCYA8. The protein sequence encoded by SCYA8 is 
represented in the public databases by the accession NP_0056I4 and is described in this patent by Seq ID 
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479. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_005623 and is described in this patent by Seq ID 480. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
5 diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. SCYA8 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 
15. 

10 The Oxford BioMedica clone p I II 9 represents GR02. The protein sequence encoded by GR02 is 
represented in the public databases by the accession NP_002080 and is described in this patent by Seq fD 
481. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_002089 and is described in this patent by Seq ID 482. Hypoxia is an important feature of several 
diseases, and genes that respond to this stimulus are therefore implicated in the pathogenesis and have 

15 utility in the design of therapeutic, prognostic and diagnostic products. Macrophages are key to several 
diseases involving hypoxia, and contribute to inflammatory processes. In these, macrophages are 
frequently activated by cytokines, which have been shown to be present at disease sites, so gene 
expression responses to both hypoxia and cytokines are especially relevant. GR02 is induced in 
macrophages activated by LPS and gamma interferon and is also induced in macrophages activated by IL- 

20 17. GR02 encodes a chemokine which is likely to be involved in the inflammatory response. Its induction 
by hypoxia provides a potential route for intervention in diseases related to inflammation and ischaemia. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, GR02 is down-regulated in the malignant tissue as 
compared to adjacent normal tissue in at least one patient. 

25 The Oxford BioMedica clone plJ4 represents Small nuclear ribonucleoprotein Dl. The protein sequence 
encoded by Small nuclear ribonucleoprotein Dl is represented in the public databases by the accession 
NP_008869 and is described in this patent by Seq ID 483. The nucleotide sequence is represented in the 
public sequence databases by the accession NM_006938 and is described in this patent by Seq ID 484. 
Hypoxia is an important feature of several diseases, and genes that respond to this stimulus are therefore 

30 implicated in the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic 
products. 

The Oxford BioMedica clone p 1 124 represents GR01. The protein sequence encoded by GROi is. 
represented in the public databases by the accession NP_001502 and is described in this patent by Seq ID 
485. The nucleotide sequence is represented in the public sequence databases by the accession 
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NMJ01511 and is described in this patent by Seq ID 486. GR01 has known chemotactic activity for 
neutrophils. GR01 belongs to the intercrine alpha family of small CXC cytokines. GR01 encodes a 
chemokine which is likely to be involved in the inflammatory response. Its induction by hypoxia provides 
a potential route for intervention in diseases related to inflammation and ischaemia. Hypoxia is an 
5 important feature of several diseases, and genes that respond to this stimulus are therefore implicated in 
the pathogenesis and have utility in the design of therapeutic, prognostic and diagnostic products. 
Macrophages are key to several diseases involving hypoxia, and contribute to inflammatory processes. In 
these, macrophages are frequently activated by cytokines, which have been shown to be present at disease 
sites, so gene expression responses to both hypoxia and cytokines are especially relevant. GR01 is 
10 induced in macrophages activated by LPS and gamma interferon and is also induced in macrophages 
activated by IL-17. Hypoxia is frequently found in human tumours where macrophage infiltrates are also 
found. In a series of 5 patients with either ovarian or breast cancer, GR01 is down-regulated in the 
malignant tissue as compared to adjacent normal tissue in at least one patient. 

The Oxford BioMedica clone pi II 8 represents Selectin L. The protein sequence encoded by Selectin L is 
15 represented in the public databases by the accession NP_000646 and is described in this patent by Seq ID 
487.. The nucleotide sequence is represented in the public sequence databases by the accession 
NM_000655 and is described in this patent by Seq ID 488. Hypoxia is an important feature of several 
diseases, and genes that respond to this. stimulus are therefore implicated in the pathogenesis and have 
utility in the design of therapeutic, prognostic and diagnostic products. Selectin L shedding by leucocytes 
20 is one aspect of the induction of the inflammatory response. Hypoxic-regulation of Selectin L is clearly a 
significant factor in the induction of inflammation following ischaemic insult or in diseases in which 
transient ischaemic conditions occur. Modulation of this induction is one aspect of the present invention. 
Hypoxia is frequently found in human tumours where macrophage infiltrates are also found. In a series of 
5 patients with either ovarian or breast cancer, Selectin L is down-regulated in the malignant tissue as 
25 compared to adjacent normal tissue in at least one patient. 
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TABLES 

TABLE I: Hypoxia-inducible genes identified from clones only derived from the cardiom yobJast library 





SEQ ID 




GENE NAME 


protein 


nucleotide 


Accession 


















Diacylglycerol kinase, zeta 


353 


354 


NMJ03646 


CCR4 associated factor 1 


391 


392 


AF0533I8 


GM2 ganglioside activator protein. 


389 


390 


X62078 


Granulin 


269 


270 


AK000607 


Serine protease 1 1 


355 


356 


Y07921 


High mobility group 2 protein 


385 


386 


M 83665 


Decidual pnrtein induced by 
progesterone 


387 


388 


NMJ07021 


DEAD-box protein abstrakt 


383 


384 


NMJ16222 


IL-I receptor antagonist 


357 


358 


U65590 


KIAA1376 protein 


29 


30 


AB037797 


Hypothetical protein KIAA0127 


31 


32 


D50917 


Hypothetical protein FLJ20308 


33 


34 


AL137263 


EST 


91 


92 


AL390082 


EST 


89 


90 


ALI 17352 


EST 


77 


78 


AW 664 180 



5 
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TABLE 2: Hypoxia-inducible genes identified from clones only derived from the macrophage libraries 





SEQ ID 




GENE NAME 


protein 


nucleotide 


Accession 










Metallothionein-2a 


265 


266 


J00271 


Metallothionein-lh 


239 


240 


X64177 


Metallothionein-IG 


243 


244 


J039IO 


Interleukin 8 


251 


252 


Y00787 


Lactate dehydrogenase A 


223 


224 


NM.005566 


UDP-glncose pyrophosphorylase 2 


347 


348 


NMJ06759 


Enolase 1 


257 


258 


NM.00I428 i 


Enolase 2 


273 


274 


NM.00I975 


Tissue factor / coagulation factor III / 
thromboplastin 


225 


226 


NMJ01993 


proline 4-hydroxylase, alpha polypeptide 1 


231 


232 


NM.0009I7 


proline 4-hydroxylase, alpha polypeptide II 


349 


350 


NM.004199 


N S 1 -binding protein 


359 


360 


NM.006469 


FGF receptor activating protein 1 


363 


364 


AF15962I 


Adenylate kinase 3 


263 


264 


NM.0J3410 


Osteopontin 


267 


268 


XI 3694 


Aldolase C, fructose-bisphosphate 


259 


260 


NM.005165 


Galectin-8 


365 


366 


AF193806 


Regulator of G-protein signalling 1 (BL34) 


375 


376 


S59049 


Polyubiquitin UbC 


377 


378 


AB009010 


Activin A receptor type I 


361 


362 


NM_00l 105 


GIyceraldehyde-3-phosphate dehydrogenase 


253 


254 


NM_002046 


Phosphoglycerate kinase 1 


255 


256 


NM_00029I 


Rab-8b 


373 


374 


NM.016530 


Glucose phosphate isomerase 


367 


368 


NM_000175 


D123 gene product (HT1080) 


369 


370 


U271I2 


Integrin alpha S 


379 


380 


NM .002 205 


Triosephosphate isomerase 1 


261 


262 


NM .000365 


solute carrier family 31 (copper transporters), 


345 


346 


NM .001860 
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member 2 








Jk-recombinalion signal binding protein 


381 


382 


L07872 


N-myc downstream regulated (NDRG1/RTP) 


229 


230 


D87953 


Plasminogen activator inhibitor-l 


235 


236 


Ml 6006 


Dec-1 


371 


372 


NMJ03670 


FUSIN/CXCR4 


331 


332 


NMJ03467 


Hypothetical protein FLJ20500 


25. 


26 


AK000507 


DKFZP564DI 16 protein 


27 


28 


AL050022 


Hypothetical protein FLJI0134 


23 


24 


AK000996 


cDNA FLJ10433 fis NT2RP1000478 


73 


74 


AK001295 ' 


ESTs 


93 


94 


AW250104 


ESTs 


95 


96 


BE382614 


ESTs 


67 


68 


AW 07 1063 


ESTs 


67 


68 


AW964331 


ESTs 


133 


134 


AA612751 


Singleton EST (not in UniGene) 


135 


136 


A 101 86 1 1 



The gene entitled "Jk-recombination signal binding protein" was found to be hypoxia-inducible using 
subtracted cDNA probes for hybridization, but with non-subtracted probes, where the hybridisation is 
quantitative, no signal was detected. This indicates that the gene is probably hypoxra-regulated but the 
5 absolute expression levels are very low. 
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TABLE 3: Hypoxia-inducible genes identified from clones derived from both macrophage and myoblast 
libraries. 





SEQ ID 


Hypoxia/ 


Hypoxia/ 


GENE NAME 


Accession 


protein 


nucleotide 


normoxia 


normoxia 










(macrophage) 


(myoblast) 


Solute carrier family 2, 
member3 


NM_006931 


247 


248 


91.39 


8.23 


Solute carrier family 2, 
member 5 


NMJ03039 


311 


312 


10.75 


2.26 


Adiphopbilin 


NMJ01122 


313 


314 


13.97 


5.10 


Hexokinase 2 


NMJOOI89 


249 


250 


11.50 


6.25 


Slearoyl-CoA desaturase 


AB032261 


351 


352 


3.74 


2.31 














cDNA DKFZp434O071 


AF125392 


75 


76 


2.31 


2.75 


Hypoxia-inducible protein 2 


NMJ13332 


271 


272 


3.62 


5.07 



5 TABLE 4: Hypoxia responses amplified by HIFlalpha overexpression 



Gene Name 


Nucl 


Experimental Condition # 




Seq 
ID 


1 


2 


3 


4 


5 


6 


7 


8 


9 
























Metallolhionein 2A 


265 


I 


0.57 


0.69 


3.33 


3.22 


5.77 


10.37 


2.05 


1.70 


Metallothionein 1G 


244 


1 


0.68 


0.64 


4.23 


4.21 


7.35 


11.03 


3.65 


2.28 


Hypothetical protein hqp0376 


338 


1 


0.79 


0.61 


6.54 


4.44 


9.01 


11.54 


4.17 


3.22 


Novel Metallothionein 


84 


1 


0.95 


0.78 


5.18 


4.36 


8.20 


11.16 


3.48 


2.94 



Legend: Data shown in the average of 4 repeat experiments. Experimental condition is as shown in the 
text. Values represent fold change as compared to untreated cells (condition 1). 
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TABLE 5: Hypoxia responses amplified by EPAS1 overexpression 



PCT/GB01/05458 



Gene Name 


Nucl 


Experimental Condition S 








Seq 
ID 


1 


2 


3 


4 


5 


5 


7 


8 


9 
























cDNA DKFZp586E1624 


66 


1 


0.77 


).67 


1.00 


1.12 


1.58 


0.83 


2.60 


2.49 


Bulyrate response factor 1 


328 


1 


0.74 


0.64 


1.60 


1.64 


1.57 


1.23 


2.19 


3.20 


hypothetical protein FLJ10134 


24 


1 


0.62 


0.53 


2.73 


2.09 


2.80 


2.87 


4.20 


3.65 


EGL nine (C.elegans) homolog 3 


86 


1 


1.34 


0.81 


1.98 


1.90 


2.02 


1.94 


2.81 


3.12 


ER01 (S. cerevisiae)-like 


68 


1 


1.02 


1.30 


4.26 


4.14 


4.76 


4.12 


4.91 


6.44 


hypothetical protein FLJ10134 


24 


1 


0.68 


0.53 


2.03 


1.97 


3.01 


2.46 


3.67 


2.95 



Legend: Data shown is the average of 4 repeat experiments. Experimental condition is as shown in the 
5 text. Values represent fold change as compared to untreated cells (condition 1). 



TABLE 6. Negative hypoxia responses amplified by HIFlalpha / EPAS1 overexpression 



Gene Name 


^ucl 


Experimental Condition # 




Seq 
!D 


1 


2 


3 


4 


5 


5 


7 


8 


























Hypothetical protein CG1-117 


48 


1 


0.83 


0.87 


0.42 


0.42 


0.32 


0.34 


0.33 


0.27 
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Solute carrier family 5, member 3 
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TABLE 15 Genes with increased expression by macrophage activation 









mRNA EXPRESSION 








(experimental condition) 


C Innp 


Qon lit 


r ana M n m n 

ijene i><jnie 


U 1 

SI 


11 


S3 


S4 


S5 


S6 




















)] Ko 




oil A 4 


0.82 


0.40 


1.15 


0.38 


91.4 


68.4 


)Id 10 


IJlllJl 


[nterleukin 8 


0.75 


1.13 


0.47 


0.41 


42.8 


28.1 


pfBIS 


251/252 


Interleukin 8 


0.85 


1.12 


0.44 


0.37 


47.4 


22.5 


pliZI 


AH(\ IA OA 

4/9/480 


SCY A8 


0.54 


0.18 


1.15 


0.32 


19.6 


12.2 


„ 1 10 A 

pi 120 


469/470 


SCYA3L 


0.92 


0.41 


1.00 


0.30 


29.4 


22.8 


p 1 N 1 7 


n moo 

237/238 


COX-2 


0.90 


1.00 


0.84 


0.84 


18.9 


20.3 


pi Jlo 


453/454 


cDNA: FLJ23019 fis, 
clone LNG00916 


0.92 


0.66 


0.91 


1.15 


14.4 


14.9 


p 1 17 


49/50 


Uridine 5 1 monophosphate 
lydrolase 1 


1.13 


0.57 


0.99 


0.52 


17.6 


23.7 


p]B 14 


1C 1 MCI 

251/252 


Interleukin 8 


0.71 


1.20 


0.51 


0.47 


10.1 


21.4 


p 1 E 1 0 


71/72 


cDNA FLJ11041 fis, clone 
PLACE1004405 


0.66 


0.74 1 


1.15 


0.81 


8.30 


12.1 


P2L23 


397/398 


endothelin 1 


1.02 


0.62 


0.74 


0.50 


11.4 


10.1 


p 1 D 1 9 


143/144 


EST 


0.63 


0.52 


1.00 


1.16 


5.46 


4.73 


plK3 


431/432 


!Pleckstrin 


1.14 


0.70 


0.73 


0.54 


6.49 


2.34 


pIC9 


373/374 


RAB-8b protein 


0.95 


0.81 


0.77 


p.94 


5.11 


4.53 


pi 124 


485/486 


GR01 


0.90 


0.72 


0.78 


1.04 


4.69 


2.96 


plG3 


317/318 


B-cell translocation gene 1 


0.70 


1.00 


0.57 


1.14 


3.51 


3.79 


plBl 


243/244 


Metallothionein 1G 


0.51 


1.00 


0.66 


1.85 


2.50 


3.83 


pi JI l 


465/466 


tatty-acid-Coenzyme A 
ligase, long-chain 2 


0.69 


0.51 


1.36 


0.91 


3.07 


2.97 


p I F 1 7 


329/330 


fi protein (candidate of 
metastasis 1) 


0.26 


1.78 


0.16 


0.88 


1.16 


2.59 


plF4 


339/340 


CYP1 


0.60 


1.04 


0.77 


1.15 


2.52 


4.22 


pi D 1 0 


75/76 


Insulin induced protein 2 


0.49 


1.00 


0.48 


1.23 


1.63 


4.96 


plE7 


53/84 


^ovel metallothionein 


0.49 


1.26 


0.70 


III 


1.32 


2.89 


plD24 


117/118 


EST 


0.58 


0.71 


1.00 


1.32 


1.56 


2.59 


p l II 9 


481/482 


GR02 


0.99 


1.00 


0.69 


3.55 


2.65 


2.29 


)iE22 


161/162 


cDNA FLJ13618 fis, clone 
PLACEI010925 


1.19 


0.77 


0.85 


3.51 


3.09 


2.41 


) I rO 


J J f 1 J JO 


hypothetical protein 
iqpUj / 0 


0.44 


1.08 


0.47 


1.06 


1.11 


2.47 


pi J7 


477/478 


Sjogren syndrome antigen 
B 


1.06 


0.74 


3.85 


3.63 


2.65 


2.95 


PIB19 


235/236 


plasminogen activator 
inhibitor, type I 


0.59 


1.29 


0.45 


1.17 


1.46 


3.46 


plF24 


297/298 


Glia-derived nexin 


0.65 


3.68 


3.99 


1.10 


1.62 


1.80 


plP5 


395/396 


SCYA2 


3.94 


3.13 


3.81 


3.42 


2.27 


1.00 


p!A22 


263/264 


Adenylate kinase 3 


3.57 


1.30 


3.58 


1.64 


1.34 


2.74 


plA23 


265/266 


Metallothionein 2A 


3.55 


0.89 


3.95 


1.01 


1.23 


4.38 


plA24 


239/240 


Metallothionein IH 


0.50 


3.84 


1.03 


1.02 


1.08 


1.60 
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p]P3 


221/222 


PDGFB 


0.47 


2.06 


0.31 


1.99 


1.00 


1.49 


plA7 


345/346 


&LC3JA2 


1.10 


0.92 


9.84 


3.93 


2.30 


2.32 


plPH 


91/92/92a 


iSemaphorin 4b 


0.47 


2.52 


3.95 


4.04 


0.96 


3.88 



Legend 

mRN A expression values in the 6 experimental conditions (#1 no cytokines/ normoxia, #2 no cytokines/ 
hypoxia, S3 IL-10/ normoxia, #4 1L-1 0/ hypoxia, #5 LPS/IFN/ normoxia, #6 LPS/IFN/ hypoxia) are shown 
5 as values referenced to the median value of that gene throughout all 6 experimental conditions. 



TABLE 16. Genes down-regulated by macrophage activation 









mRNA EXPRESSION 








(experimental condition) 


Clone 


Seq ID 


Gene Name 


#1 


#2 


S3 


#4 


85 


#6 




















plH 13 


193/194 


EST 


1.35 


1.41 


1.00 


0.91 


0.44 


0.68 


plE4 


125/126 


EST 


1.22 


1.13 


1.09 


3.96 


0.40 


0.40 


plG7 


281/282 


^ST 


1.30 


1.44 


1.01 


1.51 


3.54 


0.82 


plEl 


123/124 


EST 


1.21 


1.64 


0.94 


1.35 


0.51 


0.63 


plDI8 


127/128 


:DNA FLJI3443 fis, clone 
PLACE1002853 


1.61 


2.60 


3.57 


1.33 


0.26 


0.24 


pi 12 


149/150 


:DNA FLJ11302 fis, clone 
PLACE1009971 


2.39 


1.23 


1.07 


0.54 


0.45 


0.43 


plG20 


203/204 


cDNA YO23H03 


1.45 


0.73 


1.60 


1.12 


0.57 


3.44 


plD21 


129/130 


Hypothetical protein 
FLJ22622 


1.41 


1.72 


0.82 


1.26 


0.14 


0.14 


plF8 


J/10 


Hypothetical protein 
KIAA0914 


1.34 


4.14 


3.77 


2.74 


0.13 


3.25 


plD16 


33/34 


Hypothetical protein 
FLJ20308 


1.31 


2.36 


1.00 


1.59 


0.29 


3.67 


plF3 


333/334 


Hypothetical protein 
XPJ17131 


1.63 


2.21 


0.92 


1.00 


0.42 


0.42 


P1DI2 


29/30 


Hypothetical protein 
KIAAI376 


0.89 


2.62 


0.79 


2.07 


0.28 


2.61 


)II4 


53/54 


Hypothetical protein 
HSPCI96 


1.95 


1.06 


1.20 


0.57 


0.63 


3.28 


plD9 


27/28 


hypothetical protein 
DKFZP564D116 


1.63 


0.94 


1.25 


0.96 


0.55 


0.85 


)1F9 


19/20 


Hypothetical protein 
JCIAA0742 


0.94 ' 


3.54 


0.60 


1.74 


0.33 


1.74 


plFll 


323/324 


Hypothetical protein 
LOC51754 


1.67 


1.91 


1.00 


0.86 


0.6O 


0.59 


)1I15 


47/48 


Hypothetical protein CGI- 
117 


1.31 


3.62 


1.86 


1.26 


3.49 


3.76 


plE13 


21/22 


Hypothetical protein 
PRO0823 


1.15 


0.93 


1.15 


1.08 


3.44 


3.24 
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plFlO 


"5/6" 


Hypothetical protein 
DKFZp434P0116 


2.16 


1.05 


1.54 


0.83 


0.83 


0.67 


plDI 


23/24 


Hypothetical protein 
FLJ10134 


0.86 


1.70 


0.61 


2.42 


3.35 


1.61 


p 1 15 


41/42 


Hypothetical protein 
FU10815 


1.49 


1.00 


1.30 


0.83 


0.43 


0.37 


plG13 


293/294 


ABCA1 


1 


1.05 


1.22 


1.07 


0.43 


0.46 


plB9 


313/314 


adipophilin 


1.01 


3.74 


1.32 


2.08 


0.02 


0.2 


plB7 


313/314 


adipophilin 


1.57 


3.44 


0.77 


1.17 


0.21 


0.45 


plB6 


313/314 


adipophilin 


1.24 


2.45 


0.8 


1.51 


0.38 


0.52 


plB8 


313/314 


adipophilin 


1.1) 


1.87 


1.01 


1.04 


0.55 


0.59 


plK7 


411/412 


ATP-binding cassette El 


1.34 


0.74 


1.58 


1.05 


0.62 


0.48 


plJ23 


447/448 


Calgranulin A 


1.21 


0.94 


3.35 


2.8 


0.58 


0.86 


pi K 18 


415/416 


Colony-stimulating factor! 


2.01 


0.84 


1.7 


0.88 


1 


0.74 


p)C2 


331/332 


CXCR4 


1.01 


3.76 


0.46 


1.47 


0.08 


1.13 


p)Cl 


331/332 


CXCR4 


1.05 


3.64 


0.39 


1.63 


0.27 


0.98 


plG 12 


321/322 


Cyclin G2 


0.85 


2.17 


0.6 


1.29 


0.28 


1.33 


plF16 


325/326 


CYP1B1 


1.37 


0.96 


1.67 


0.66 


0.64 


0.48 


p!C7 


369/370 


D123 


1.69 


1.33 


1.1 


0.7 


0.65 


0.83 


plG 17 


315/316 


Early development regnlator 
2 


0.97 


2.47 


1.12 


2.24 


0.29 


0.85 


pll23 


475/476 


Ecotropic viral integration 
site 2A 


1.39 


1.25 


1.11 


1.72 


0.18 


0.22 


pi A 14 


257/258 


Enolase 1 


0.99 


3.22 


1.19 


2.46 


0.13 


0.37 


piAlO 


273/274 


Enolase 2 


1.17 


5.28 


0.59 


3.77 


0.49 


1.08 


plD6 


67/68 


EROl (S. cerevisiae)-like 


0.84 


3.02 


0.97 


2.87 


0.32 


1.58 


plAll 


253/254 


GAPDH 


1.21 


2.41 


0.93 


1.31 


0.32 


0.81 


pi A 1 2 


253/254 


GAPDH 


1.09 


1.97 


1 


1.49 


0.41 


0.96 


plK22 


419/420 


GPR44 


1.24 


1.03 


1.42 


0.93 


0.56 


0.48 


plCl 8 


269/270 


G ran u Ihi 


1.28 


1.59 


0.96 


1.03 


0.56 


0.72 


plCI7 


269/270 


Granulin 


1.58 


1.6 


0.62 


0.41 


0.76 


0.94 


P1A15 


249/250 


Hexokinase-2 


0.89 


3.88 


0.68 


3.11 


0.38 


2.02 


plC13 


381/382 


Jk-recombination signal 
binding protein 


1.11 


1.18 


1.43 


1.98 


0.32 


0.73 


plA8 


223/224 


Lactate dehydrogenase A 


0.7 


• 2.25 


1.4 


1.44 


0.26 


1.32 


plA9 


223/224 


Lactate dehydrogenase A 


0.77 


1.85 


1.15 


1.68 


0.32 


1.19 


plG5 


279/280 


MAX-interacling protein 1 


1.24 


5.5 


0.9 


4.48 


0.34 


0.97 


plD22 


119/120 


MAX-interacting protein 1 


1.2 


3.86 


0.52 


3.44 


0.37 


0.91 


plG 1 8 


211/212 


Mitochondrion sequence 


1.27 


1.12 


1 


1.31 


0.57 


0.77 


plK23 


403/404 


MYC 


1.37 


0.77 


2.39 


1.09 


0.54 


0.35 


plE20 


99/100 


Myo-inositol 
monophosphatase A3 


1.12 


1.28 


1.02 


0.99 


0.48 


0.61 


plB20 


267/268 


Osteopontin 


1.13 


1.58 


0.99 


1.52 


0.1 


0.4 


plF13 


343/344 


Papillomavirus regulatory 
factor PRF-1 


0.98 


5.02 


0.44 


6.79 


0.09 


2.43 


plA13 


255/256 


Phosphoglycerate kinase 1 


1.04 


2.45 


1.23 


1.83 


0.2 


0.9 


plG9 


305/306 


Pl-3-kinase, catalytic, beta 
polypeptide 


1.46 


1.88 


0.75 


1.17 


0.44 


0.47 
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plEl 8 


63/64 


Plexin CI 


1.72 


1.79 


1 


0.85 


0.69 


0.35 


pICll 


377/378 


polyubiquitin 

i j 


1.13 


1.79 


0.79 


1.14 


0.5 


0.84 


P IB3 


231/232 


Proline 4-hydroxyIase, alpha 
polypeptide I ' 


0.94 


1.38 


1.03 


1.58 


0.43 


0.89 


p]B4 


349/350 


Proline 4-hydroxylase, alpha 
polypeptide IJ 


0.9 


1.46 


1.05 


1.41 


0.44 


1 


p!B22 


355/356 


Protease, serine, 1 1 


1.3 


1.1 


1.26 


0.92 


0.64 


0.7 


pICIO 


375/376 


Regulator of G-protein 
signalling 1 


1.42 


1.68 


0.94 


1.55 


0.47 


0.95 


plD3 


95/96 


Serine carboxypeptidase 1 


1.22 


1.07 


1.07 


1.09 


0.33 


0.88 


plF15 


34J/342 


SHB adaptor protein 


1.04 


1.61 


0.94 


1.72 


0.43 


0.54 


plA5 


311/312 


SLC2A5 


0.71 


2.6 


1.06 


2.09 


0.34 


1.09 


plG4 


307/308 


SLC5A3 


1.12 


1.44 


0.93 


1.31 


0.33 


0.62 


plA20 


261/262 


rriosephosphate isomerase 1 


0.97 


2.06 


1.09 


2.24 


0.17 


0.66 


p!DI5 


31/32 


TR1P-Br2 


1.16 


1.4 


1.1 


1.25 


0.47 


0.46 


plK4 


443/444 


rsc-22 


1.44 


1 


1.55 


0.7 


0.6 


0.57 



TABLE 17: Genes responsive to 1L-10 (increased or decreased) but not affected significantly by LPS+1FN 









mRNA EXPRESSION 








(experimental condit 


ion) 


Clone 


Seq ID 


Gene Name 


111 


n 


S3 


U 


85 


U 




















p)H8 


181/182 


ABL 


1.02 


0.96 


6.65 


5.25 


0.86 


0.73 


pi El 5 


107/108 


cDNA YI27F12 


5.48 


0.77 


1.69 


2.45 


0.78 


1.44 


p2A14 


445/446 


EST 


1.06 


0.74 


2.78 


3.09 


1.06 


0.82 


plH6 


191/192 


EST 


1.01 


0.84 


2.47 


2.05 


3.93 


0.83 


plE5 


141/142 


Uepcidin antimicrobial peptide 


0.84 


3.73 


1.91 


1.68 


3.58 


2.16 


p 1 11 2 


151/152 


Hypothetical protein MGC4549 


1.07 


0.67 


2.34 


2.53 


III 


3.74 


plD8 


271/272 


Hypoxia-inducible protein 2 


[).65 


1.00 


1.51 


1.89 


3.71 


2.05 


pi K 14 


421/422 


keratin 6B 


1.03 


3.68 


3.80 


3.28 


0.97 


0.76 


plJ22 


427/428 


Neutral sphingomyelinase (N-SMase) 
activation associated factor 


0.94 


0.79 


5.59 


3.52 


0.91 


1.29 


blGI5 


287/288 


Phosphoglucomotase 1 


0.82 


1.20 


1.83 


1.90 


3.61 


1.05 


blA2 


247/248 


SLC2A3 


0.37 


3.31 


1.00 


3.32 


3.49 


2.63 


bl A3 


247/248 


SLC2A3 


0.39 


2.45 


1.00 


2.65 


3.20 


1.50 


pIK2 


433/434 


CFFM4 


1.30 


3.98 


0.51 


0.59 


1.11 


0.91 


P 1C4 


363/364 


FGF receptor activating protein 1 


1.02 


0.96 


0.50 


0.63 


1.16 


1.31 
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TABLE 20: Genes up-regulated in response to TNFa 
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TABLE 25: Genes down-regulated in response to 1L-15 









Cytokine / % Oxygen 








none 


none 


L-15 


L-15 


Clone 


Gene Name 


SeqID 


20 


0.1 


20 


0.1 
















plH8 


ABL 


J 82 




1.08 


1.22 


0.09 


plC)4 


Absirakl 


384 


1 


0.69 


0.40 


1.24 


plBS 


Adipophilin 


314 




1.08 


0.23 


1.42 


plB7 


Adipophilin 


314 




1.10 


0.30 


2.02 


p)86 


Adipophilin 


314 


I 


0.98 


0.37 


2.39 


plB9 


Adipophilin 


314 




1.46 


0.41 


1.66 


plAI9 


Aldolase C 


260 


1 


0.50 


0.35 


1.30 


plC7 


D123 


1370 




0.53 


0.47 


0.90 


plH6 


EST 


192 




1.46 


1.95 


0.67 


p2A14 


EST 


446 




1.10 


1.08 


0.51 


plG24 


Glycogen synthase 1 


276 




1 45 


3 88 


) 55 

J .J J 


plJ6 


Hypothetical protein 
FLJ10206 


40 




1 28 

1 mi. O 


1 28 

I .60 


0 18 


pi 11 2 


Hypothetical protein 
MGC4549 


152 




1.20 


1.64 


0.28 


plD8 


Hypoxia-inducible protein 2 


272 




1.30 


1.25 


0.62 


pi K14 


Keratin 6B 


422 




1.27 


1.48 


3.11 


plA8 


Lactate dehydrogenase A 


224 




1.31 


3.48 


2.29 


P 1A9 


Lactate dehydrogenase A 


224 




1.95 


0.49 


1.83 


plJ22 


Neutral sphingomyelinase (N- 
SMase) activation associated 
factor 


428 




1.21 


1.51 


0.11 


plB4 


Proline 4-hydroxylase, alpha 
polypeptide II 


350 




0.81 


0.50 


1.34 


|pl A20 


Triosephosphate isomerase 1 


262 




0.81 


3.36 


1.22 



WO 02/46465 



PCT/GBOi/05458 



244 





ION 




co 




— 






CM 


rO 


CM 


CM 


oo 


CO 


vo 




CM 






CO 




CO 
ON 
CO 


-«o- 


CO 
CM 


r — - 


CM 




c-O 
CO 

PJ 




i — 


-^r 


vo 




CM 




OO 


vo 


VO 


"O- 


CO 


On 


co 


CM 


ON 




vo 




CO 


co 


co 
oo 




VO 










vo 




r — 


r» 


vo 




-«3- 


vo 


vo 






CO 






CM 




C~M 


■*o- 






fN 




co 


<o 
£r? 


1 — 

m 


8696' 


o 

<N 


CO 


CO 
cm 


ro 




CO 
CO 


vo 


CO 


CO 


CO 
CO 




ON 

co 


CO 




CO 

1 


X — 
~~~~ 


CO 


— 


cO 


CJ 










CO 




CO 






~— 


" 






co 


CO 


* 




NM 


*~" 


1 






U- 


O 










CQ 


. . 
-> 


CQ 




— ) 


tu 


tu 






-J 


CQ 


CO 


1— 1 












o 


■< 




<: 






Q 






<c 




■<. 


<: 








■< 






■< 








-< 
























































^3 




















































t^j 


£ 


















































| — 3 


































































<o 






vo 


CO 


CO 


CM 




vo 


oo 


CO 


CM 


"«o- 




vo 


oo 


CO 


CM 












-«* 


vo 


OO 




— 






' 


<N 


CM 






Ol 


CO 


CO 


co 




CO 


CO 




























CM 




















































oo 










































































































































o 




ro 


co 


CO 








ON 


CO 

o 


CO 


CO 


CO 


OO 




CM 


■^o- 


VO 


oo 




oo 

CM 


CM 


ON 


CO 


co 








CO 


CO 


VO 


<o 








c*~» 


CO 


1 — 






^O" 










CM 








CO 










CM 




(N 


CO 


II 




CO 






CJ 




vo 


x^- 


e — ^ 




CO 

vo 






oo 


c^~> 




CO 




-sr 


CO 


CO 






CO 




CO 


CO 


r— 








CO 


rM 


ON 






CO 


co 
vo 




CM 




tx3 






CTn 


r~- 






vo 




<L> 


CO 


VO 


r>- 


On 


ON 




ON 


CO 


On 




CO 




ON 




r 5 

Vw> 




OQ 




CQ 


*C 




<o 

1 


< 


CO 


CQ 


tu 


LU 


<: 


<: 


CQ 


<: 


<; 


<c 




CO 


CQ 


CO 




<c 


2 






<: 


<c 




<C 


< 


p- 




CU 


<C 


<: 


<C 


■< 


<: 


<: 


<c 








CU 


•< 


CU 












pa 


CQ 


CJ 


CQ 


CQ 




CQ 


CZ 


o 


<: 




CQ 


CQ 


CJ 


ca 


CQ 


CQ 






CJ 


3: 


CQ 


CQ 






















































RO 






















































o 




























































































»o 












CO 






CO 


to 


I — • 


On 












CM 
















CO 








^<0" 












































cr 




cr 


cr 


«=:■ 












































o 


o 


o 


o 


o 


CO 


QJ> 




aj 


ID 


CL? 


CL> 


a_> 


CD 


CD 


CL> 


<u 


<o 


<o 


CO 


CO 


CO 


CO 


CO 


CO 






CO 


to 


to 


co 




CO 




CO 


to 


CO 


CO 


CO 


CO 


CO 


CO 


co 


co 


to 


to 


co 


co 


CO 


CO 


CO 






co 


co 


to 


to 


X 


c= 




CO 


CO 


CO 




CO 


ea 


CO 


CO 


CO 


CO 


CO 


co 


ca 


CO 


ca 


CO 


CO 




<o 


<o 


<o 


CO 


CO 


o 


o 




CD 


co 


eu 


£- 


<u> 


CL> 


<L> 


CL> 


<L> 


CO 


CO 


CO 


o 


CO 


CO 


CO 


CO 














cx 


CU 














%— 


i— 










l_ 




l_ 










Q. 


o_ 


O. 


O. 


O- 


>1 


co 




CJ 


o 


o' 


O 


c_> 


CJ 


o 




O 


o 


o 


o 


o 


CO 


o 


c_> 


O 




CO 


CO 


CO 


CO 


QJ 


DC 


CO 




cz 


c: 


cr 


. C 


C= 


1= 


cz 


CT 


c: 


cz 


cz 




c= 




cr 


cr 


cr 






Pi 


P£ 


cc: 


P2 








CM 














VO 






































cO 




















































CO 














CM 






































CO 














CO 










































vo 










CO 




oo 














































CO 




















































CM 




oo 


















CM 
CM 




















CO 










tu 




CO 
































CJ 




cu 










cj 




CO 


































LA 


co 


-«0- 

m 






oo 




LA 




o 


CO 


^* 

CO 


CO 
CO 






oa 


oo 

CO 




IBB 






VO 


CO 
CO 








a* 


<o 


**|- 
O. 






CM 




Cu 




-J 


CM 
OO 

CO 




CO 






CO 


CO 








vo 


oo 
CO 










co 


<0 
cm 


IvJ 






CO 
CM 




a> 
c= 




cr 


CO 


CO 
CM 






-< 


CO 
CM 




o r 




CO 
CM 










en 
o 


Z3 


tu 

^ 






»— > 

.-J 




o 


"5 


"co 

o 


PRO 


> — > 


[3 


3tein 






!3 




art 




CO 


FIJI 










o 


tu 








tU 




o 


o 

v. 




tu 


tu 






tu 






* CQ 


> 


Lu 








II fis 


rotein 


otein 






rotein 




00 fis 


O. 

Ol 
CM 


l_ 

<o 
oc 
cr 


•otein 


c= 
"jo 
o 

I — 


cz 
"5 
o 


V— 

O. 
VO 


» tein 


rotein 


rotein 




uciea 


o 
»— 

vo 


tu 
o 


•otein 


rotein 










CL 


CTC 






Oh 










cx 




o. 




o 


o 


o 




cr 




o 




L Q 








LJI3 


tical 


ical 1 


CM 




tical 


vo 


LJ13 


7 ZP586G 


zinc 


"eo 

CJ 


tical 


tical 


-ZP564D 


Q, 
VO 


tical 


tical 




tical 


CO 


tical 




tical 








tu 


othe 


<o 


CM 
CO 


Os 
CO 


co 
o 


oo 

CO 


tu 

<: 


tive 


o 


othe 


othe 


CO 


othe 


othe 




CO 
_cr 

O 


'KFZPi 


CO 

-cr 
o 


"cL 

o 


othe 




IT 




Q 

o 


o. 


o 
o. 




< 


O- 

>-l 




2: 

Q 




CO 
=3 




o. 


o 

>■ 








Q 

>- 




o 




■ c 

- > 


















EC 




c_> 


o 


CU 






3= 




S 




Ou 




a: 











SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GB01/05458 



245 



I — 

ON 

r— - 
CD 

2. 


On 
co 
VO 

CD 

1 


NMJ16489 


D25216 


NM_016464 


AK002158 


AL0223II 


AK023078 




A L 1 10163 


AF035307 


AL110152 


AK024224 


AL1 37473 


AK00I903 


AK00I295 


AF125392 


AK026966 


R62339 


AA489477 


R0660I 


R00332 


AA463469 


H56028 


AA293300 


AW250104 


BE3826I4 


H59618 


AA449703 


AA5213II 


W69170 | 


vo 


CO 


CD 


CNl 

to 


m 


vo 
VO 


CO 

to 


CD 

VO 




CNJ 

VO 


vo 


vo 
vo 


CO 
vo 


CD 


CNJ 




vo 


oo 


CD 

OO 


CN» 

CO 


oo 


vo 

OO 


oo 
oo 


CD 
ON 


CM 
ON 


ON 


vo 

ON 


OO 

ON 


10.0 


CM 
CD 


CD 


NPJ60387 


NP_057475 


NPJ57573 


BAA04946 


oo 
-^r 
to 
r~- 
to 

CD 

1 

o- 
2. 


BAA92II5 


CAB62980 


BAB 14393 




none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


to 


t — 


ON 




CO 

vo 


to 
to 


t — 
to 


ON 

to 




vo 


CO 

vo 


to 

vo 


VO 


ON 
VO 


r — 


CO 

r— 


to 
1 — 


i — 

r — 


ON 

r — 


CO 


CO 

oo 


to 

OO 


i — 

oo 


ON 
OO 


ON 


CO 

ON 


to 

ON 


ON 


ON 
ON 


CD 


CO 
CD 


Repression 


depression 


Repression 


depression 


depression 


Repression 


Repression 


Repression 




Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


hypothetical protein FLJ2064 1 


Hypothetical protein HSPCI 11 1 


hypothetical protein LOC5I251 1 


KIAA00I4 i 


Hypothetical protein HSPCI96 


Hypothetical protein FLJI1296 


Hypothetical protein b A395L 1 4 


cDNA FLJI30I6 fis, clone NT2RP3000624 




cDNA DKFZp586H0324 clone DKFZp586H0324 


Clone 23785 


cDNA DKFZp586E1624 


cDNA FLJ14I62 fis, clone NT2RM4002504 


cDNA DKFZp434EI723 (clone DKFZp434E 1 723) . 


cDNA FIJI 1041 fis, clone PLACEI004405 


cDNA FLJ10433 fis NT2RP1000478 


cDNA DKFZp434O07l 


cDNA FLJ233I3 fis, clone HEPI 1919 


ESTs 


ESTs 


ESTs 


ESTs 


us 
E— 
OO 


or 

f- 

OO 

ix: 


PSTq 


E— ■ 

oo 
u: 




CO 

E- 

oo 

u: 


6/3 

E— 1 
oo 


or; 

E— 
oo 
UJ 


EST* 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



246 



PCT/GB01/05458 



R5I835 


H87770 


R69248 .1 


T68844 


AA454I77 


W) 

VO 
o-J 
CO 

<c 


T73780 


AA40I496 


AA489636 


AA446361 1 


AA93I4JI 


R24223 


R22252 


AA6I275I 


AW96433I 


A 1018611 


AA45I886 


R06520 


T48278 


R68736 




AK024090 


AL05002I 


AK002I64 


AK00II71 


AK022731 


AA420992 


f — 
r — 

CO 
ON 

vo 

<c 
<: 


AA456437 


AA429367 


AA434382 | 


VO 

CO 


CO 

<o 


CD 






VCP 


CO 


CO 


oo 
oo 


oo 


VO 
OO 


CO 

oo 


CO 
CO 


oo 

CO 


CO 


vo 

CO 


oo 

CO 


CO 


OO 






VO 


CO 


CO 
»o 


oo 
in 




vO 
to 


oo 
tn 


CO 
vo 


CO 
vo 


vo 


none 


none 


none 


none 


none 


none 


none 


^one 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 




none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


m 


co 


On 
CO 






uo 




ON 


oo 


CO 

oo 


oo 


t — 

oo 


ON 
OO 


CO 


CO 

r»~j 


ro 


c — ■ 

CO 


ON 
CO 




CO 




to 


r— 


ON 


VO 


CO 
m 


to 
to 


to 


ON 

to 


vo 


CO 
VO 


















































\ 














ncrease 


Increase 


Increase 


[ncrease 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 




Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


to 

r- 

oo 


ESTs 


ESTs 


ESTs 


to 

f— 

CO 

to 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


to 

t— 

CO 

wl 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 




cDNA FLJI4028 fis, clone HEMBAI003838 . 


r.DNA nKF7nSfi4D016 (clone DKFZo564DOI6) 


cDNA FLJI 1 302 fis. clone PLACE100997I 


NEDO FLJ10309 fis cl NT2RM2000287 


Isp.niiP.nrft from clnne RPI 1 -39402 on ch 20 


Ip.STs 


RST« 


ESTs 


ESTs 


ESTs 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



247 



PCT/GB01/05458 



AA664228 


R44397 


AA923509 


W87747 


CO 

vo 
to 
cO 
r. — 

-a: 


T98529 


AA022679 


H 1792 1 


R00766 


W9I958 


R63694 


AA425386 


AA9099I2 


T99032 


H52503 


A A ! 2701 7 


R38647 


T87233 


AAI3035I 


H49601 


AA598952 


AA99I868 


T60III 


AA897090 


AA679939 


AA630I67 




NMJ04331 


91S900 WN 


NMJ02608 


NMJ05566 I 


VO 
vo 


CO 

VO 


CO 


cm 

r- 




VO 

1 — 


CO 

I — 


CO 
OO 


CM 
CO 


CO 


vo 

CO 


CO 
CO 


CO 
ON 


CM 

ON 


ON 


VO 

ON 


oo 

ON 


<o 

CO 
cm 


CM 

cO 


cO 
CM 


vo 

CO 
CM 


CO 

<o 


CO 

CM 


CM 
CM 


CM 


vo 

CM 




oo 

O-J 


CO 
CM 
CM 


CM 
CM 
CM 


fNj 

CM 


none 


a> 
c= 
o 
t= 


none 


none 


c= 
o 
c= 


none 


none 


none 


none 


none i 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none 


none i 


none 




NP 004322 


NPJ06507 


NP.002599 


NP.005557 


VO 


1 — • 

VO 


Ov 
VO 


r — 


cO 
1 — 


r — 


r— 


ON 

r— 


CO 


CO 
CO 


to 

CO 


c — 

CO 


OS 
CO 


ON 


CO 
ON 


to 

ON 


i — 

ON 


ON 
ON 


CO 

CM 


CO 

<o 

CM 


to 
CO 


CO 
CM 


ON 
CO 
CM 


CM 


CO 
CM 


«Oi 
CM 




CM 


ON 
CM 


CM 
CM 


CO 
CM 
CM 


Repression 


Repression 


Repression 


Repression 


Repression | 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression i 


Repression 


Repression 


Repression \ 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 




Increase 


Increase 


Increase 


Increase 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


CO 

t- 

CO 


ESTs' 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs . 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 




BCL2/adenovirus ElB 19kD-interacting protein 3-Iike 


ISnlute carrier familv 2. member I 


IPDGF hetn 


lactate dehydrogenase A 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/G BO 1/05458 



248 



NMJ0I993 


NM 003376 \ 


960900 m 


NM.0009I7 


NMJ04052 


M 16006 


U04636 


X64834 


AJOl 1772 


J039I0 


M 10942 


M 20681 


Z46376 


Y00787 


NM 002046 


NM J00291 


NMJ0I428 


NMJ05I65 


- 

to 

VO 

CO 

<o 
<o 

Os 

z. 


NM 013410 


J00271 


XI3694 


AK000607 


OJ 
ro 
CO 

Z 


NM 001975 


U32573 


NM_00I627 


NM_005962 


NM 006311 


U49835 


NMJ0II86 | 


vo 

OJ 

oJ 


CO 
CM 


o 

cO 
OJ 


OJ 
CO 

OJ 


*«d- 

OJ 


vO 
CO 
OJ 


oo 

CO 
OJ 


-«d- 

OJ 


OJ 
OJ 


rJ 


VO 
OJ 


oo 

OJ 


CD 
VO 
OJ 


OJ 

to 

OJ 


^d- 
to 

OJ 


vo 
to 

OJ 


CO 

to 

OJ 


CO 
vo 

OJ 


OJ 

vo 

OJ 


vo 
OJ 


vO 
vO 
OJ 


CO 
VO 
OJ 


o 
r- 

OJ 


OJ 

r- 

OJ 


o- 

OJ 


vo 
o- 

OJ 


oo 
r— 

OJ 


CO 
CO 
OJ 


OJ 

oo 

OJ 


co 

OJ 


vo 

CO 
OJ 


NP.001984 


NP_003367 


NPJ06087 


NP 000908 


NPJ04043 


AAA60003 


AAA573I7 


CAA46046 


P80297 


AAA59873 


AAA59587 


AAB6I083 


CAA865II 


CAA68742 


NP 002037 


NPJ00282 


NPJ0I4I9 


NP.005I56 


NPJ00356 


NP.037542 


AAA59583 


CAA3I984 


AAA58617 


NPJ37464 


996100 dN 


AAB60385 


819100 dN 


NP 005953 


NP_006302 


AAC50597 


NPJ01I77 


OJ 

oj 


r- 

OJ 


ON 

OJ 
OJ 


CO 
OJ 


CO 

m 

CM 


to 

CO 
OJ 


r— 
ro 

OJ 


On 
ro 
OJ 


OJ 


OJ 


vo 

-d- 
OJ 


t — 

-«d- 

OJ 


On 
OJ 


vo 

OJ 


co 
to 

CJ 


to 
to 

OI 


o- 

tO 
OJ 


ON 

to 

OJ 


VO 
OJ 


CO 

vo 

OJ 


to 
vO 
OJ 


t — 
vo 
OJ 


ON 
vo 
OJ 


r— 

OJ 


CO 

r— 

OJ 


to 
o- 

OJ 


o- 
r- 

OJ 


On 

r — 

OJ 


oo 

OJ 


co 

CO 
OJ 


to 

CO 
OJ 


ncrease 


[ncrease 


increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Tissue factor 1 


Vascular endothelial growth factor 


RTP/NDRGI 


cd 

Q- 

cd 

CO 
co 
cd 

>-. 

X 

o 

I— 

-o 

>^ 

o> 
cz 

*o 

CL 

cz 

CO 

oo 
cd 

"o 

o 
o 

I— 


|BCL2/adenovirus E IB-interacting protein 3 


Plasminogen activator inhibitor, type I 


Cyclooxygenase 2 


Metallothionein IH 


Metallothionein )L 


Metallothionein -IG 


Metallothionein 1 E (functional) 


Solute carrier family 2, member 3 


Hexokinase 2 


Interleukin 8 


Glyceraldehyde-3-phosphate dehydrogenase 


Phosphoglycerate kinase 1 


Enolase 1 


aldolase C, fructose-bisphosphate (ALDOC) 


Triosephosphate isomerase 1 (TPII) 


Adenylate kinase 3 (AK3) 


cd 
OJ 

cr 
"co 

O 

■S 
o 

cd 


cr 

c= 
o 
o 
o 

GO 

O 


Granulin 


Hvnnxia-inducihle nrotein 2 


Ennlase 2, (pamma, neuronal) 


Glvcopen synthase 1 (muscle) 


Activated leucocvte cell adhesion molecule 


M A X-interactinp nrotein 1 


Nuclear recentnr co-renressor 


OJ 

CO 
J^. 

CO 

co 

CO 

cd 
cr 

-cz 


BACH1 transcription factor 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



249 



PCT/G BO 1/05458 



co 
co 

CM 
CD 
CD 

1 


A F 1 5 1 867 


VO 
OO 
CO 

CD 
CD 

^ 

Z. 


NM JJ05502 


AJI3I244 i 


NM 005878 


U83I92 


NM 005079 


NM 004064 


NM 006219 


CO 
CO 
ON 
VO 
CD 
CD 

1 

Z 


NM 004288 


M 5553 1 


NM 001122 


NMJ04427 


NM JJ0I73I 


NM JH 6224 


NM J04354 


NM 016446 


NM .000104 


NM 004926 


NM_0I 2385 


r — 

NO 

CO 

CD 
CD 

^' 

z. 


U79745 


NMJ068I3 


AF083255 


AB005989 


X75342 


NMJI8660 


NM_00l 860 


NM.006759 | 


CO 
OO 


CD 
On 
*-o 


CM 

o> 




VO 

ON 


OO 

ON 


CD 
CD 


CM 
CD 


CD 


NO 

CD 


CO 
CD 


CD 


CM 




vo 


CO 


CD 
CM 


CM 
CM 


CM 


NO 
CM 


CO 
CM 


CD 
CO 


CM 
CO 


CO 


NO 
CO 


CO 

CO 


CD 


CM 




NO 


CO 


.002624 : 


D34I04 : 


003855 


005493 


A10334 


.005869 


C52II3 


005070 


.004055 


.006210 


.008864 


.004279 


A52570 


.001113 


.004418 


.001722 


057308 


.004345 


.057530 


S60000" 


004917 


t — 

to 

vo 

CO 

CD 

1 


.003458 


C52014 


.006804 


C32396 


A22656 


A53091 


.061130 


001851 


006750 


NP. 


VV 


NP 


NP. 


o 


NP 


AA 


NP 


NP 


NP 


NP 


NP 


VV 


NP 


NP 


NP. 


NP. 


NP. 


NP. 


NP 


NP 


NP 


NP 


VV 


NP. 


VV 


BA 


CA 


NP. 


NP. 


NP. 


CO 
CM 


ON 

CO 
OJ 


On 
CM 


co 

ON 

CM 


to 

ON 
CM 


x — ■ 

ON 

CM 


ON 
ON 

CM 


CD 

CO 


CO 
CD 

CO 


to' 

CD 

CO 


r— : 

CD 

CO 


ON 

CD 

CO 


CO 


CO 
CO 


in 

CO 


t— - 
CO 


ON 
CO 


CM 

CO 


CO 
CM 


to 

CM 
CO 


i — 

CM 
CO 


ON 
CM 
CO 


co 

CO 


CO 
CO 
CO 


to 

co 

CO 


r— 

CO 
CO 


ON 
CO 
CO 


co 


CO 
CO 


to 

co 


r— ■ 
co 


ease 


ease 


ease 


ease 


ease 


-ease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


rease 


ncr 


ncr 


[ncr 


[ncr 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


Inc: 


Inci 


Inci 


Inci 


Inci 


Inc 


Inc 


Inci 


Inci 


Inci 


Inci 


Inci 


Inci 


















































CM 














Phosphoglucomutase 1 


CGM09 protein 


|SAP30 


ATP-binding cassette transporter- J 


SEC24 protein 


Trinucleotide repeat containing 3 


Post-synaptic density protein 95 


Tumor protein D52 


Cyclin-dependent kinase inhibitor p27kipl 


phosphoinositide-3-kinase, catalytic, beta 


Solute carrier family 5, member 3 


PSCDBP 


Solute carrier family 2, member 5 


Adipophilin 


Early development regulator 2 


B -cell translocation gene 1, 


SH3PXI 


Cyclin G2 


NAG-5 protein 


Cytochrome P450 IB I (dioxin-inducible) 


Butyrate response factor 1 


d8 orotein (candidate of metastasis 1) 


chemokine (C-X-C motif), receptor 4 (CXCR4) 


solute carrier familv 16, member 6 


Proline-rich Drotein with nuclear targeting signal (B4- 


RNA helicase-related orotein 


Cvtochrome P450, subfamily XXVIIB, polypeptide 1 


SHB adaotor orotein 


Paoillomavirus regulatory factor (PRF-I) 


SLC31A2/hCTRI 


UDP-glucose pyrophosphorylase 2 (UGP2) 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



250 



PCT/GB01/05458 



ON 
ON 

^d- 

CD 
CD 

:> 


AB03226I 


NM 003646 


Y0792I 


U65590 


NM.006469 


NMJJ0II05 


AFI5962I 


AF193806 


NMJ00I75 


U271 12 


NM 003670 


NMJI6530 


S59049 


AB009010 


NM 002205 


L07872 


NM 016222 


M83665 


NM 007021 


DO 

r*- 

CD 

CNl 

vo 
X 


AF053318 




CD 

r— 

CD 
CD 
CD 

2: 


On 

1 


m 

LO 
CTN 

i 


LI2723 


ZD 
CD 

CD 
CD 
CD 


CD 
»n 

rO 


uo 
CO 


rO 


vo 

tO 
CO 


CO 
LO 
CO 


CD 
vo 

CO 


CNJ 

vo 

CO 


vo 

CO 


VO 
vo 

CO 


CO 
VO 
CO 


CD 

r— 

CO 


CM 
I 

CO 


r- 

co 


vo 
! — 
CO 


CO 

r— 

CO 


CD 
CO 
CO 


OJ 
CO 
CO 


co 

CO 


vo 

CO 
CO 


CO 
CO 
CO 


CD 

ON 
CO 


CN» 

ON 
CO 




ON 
CO 


VO 

ON 
CO 


CO 
ON 
CO 


CD 
CD 


Ol 
CD 
-<rj- 


NPJ04I90 


BAA93510 


X — 

rn 
vo 

CO 

CD 
CD 

P- 

2: 


BAAI 3322 


AAB92268, 
AAB92269, 
A AB92270 


NPJ06460 


960100 m 


AAFI9I56 


AAFI9370 


991000 dN 


AAC34738 


NPJ03661 


VO 

r— 
to 

CD 
P- 


AAB26289 


BAA23632 


NPJ02I96 


AAA60258 


NP.057306 


AAA58659 


NP 008952 


CAA43993, 
CAA43994 


AAD02685 




NP 000261 


NP 002973 


NPJ01946 


AAA02807 


INPJ00691 


ON 
*<^- 
<-0 


<-o 


CO 
tO 
CO 


to 

CO 

CO 


to 

CO 


ON 

to 

co 


vo 

CO 


CO 
vo 

CO 


to 

VO 
CO 


r— 
vo 

CO 


ON 

VO 
CO 


1 — ■ 

CO 


CO 
CO 


to 
l — 

CO 


CO 


ON 
CO 


CO 
CO 


CO 
CO 
CO 


to 

CO 

CO 


co 

CO 


ON 
CO 
CO 


ON 

CO 




CO 
ON 
CO 


to 

On 

CO 


l — 

On 

CO 


ON 

Ov 

CO 


CD 
•«cf 


ncrease 


ncrease 


ncrease 


ncrease 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 


Increase 




Repression 


Repression 


Repression 


Repression 


Repression 


iProline 4-hvdroxvlase, alpha polypeptide II 1 


Stearovl-CoA desaturase 1 


Diacyl&lycerol kinase, zeta i 


Serine protease 1 1 


IL-1 receptor antagonist, alternatively spliced forms 


NSI -binding protein 


Activin A receptor type I 


FGF receptor activating protein 1 (FRAGl) 


Galectin-8 


Glucose 6-phosphate isomerase 


DI23 protein 


Decl. 


OO 

-o 

CO 

Pi 


BL34 


Polvubiauitin UbC 


Infeprin alnha 5 


Ik-renomhination signal bindine orotein 


CCJ 
i— 

CO 

_D 

o ca 
c 
'5 
c 

1— 

c 

X 

c 

< 


Hioh mohilitv aroun 2 DTOtein 


o 
t= 
o 

OO 
OJ 

Cxi 
o 
o 
> 

JO 

-a 

L CJ 

U 

c 

* — 

o .- 

a: 

■> -*— 

. c 

• c 

* c; 
'. 

c 

r a. 

: c 


GM2 ganglioside activator protein. 


< 

CJ 

I— 

c 

CJ 
ec 

t+- 

TZ 
0. 

CC 

C 

c 

is 
c 

CC 

PS 

C- 

c 


> 

> 
) 
) 

3 

J 

) 
> 


CJ 

cr 
ca 

C 
_c 
c 

cv 
C 

c 

CL 

-c 

o- 

c 
_a. 

c 

2 


MnnnrvtP rhpmntar.fir. nrotein 1 


► s 

a. 

» JC 

! c 

;3 


Ppof chnrlf 7nirn nrftfftin 4 


Annexin AI 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



251 



PCT/GB01/05458 



CO 
ON 



to to 



0> 
Q- 



-<=r 



o> 

C3 



no 



»o 
on 
»o 

-< 



ON 



-O 



ca- 
po 

o, 

tx3 



<: 
o 



E 

CN> 



o> 
ex. 



IO 



vO 



O 

l 



ON 
ON 

O 



ON 
VO 

ro 



2: 



a: 



c 




-c 




c 


> o 


c 




c 


> Q. 


*c 


^ -< 


p: 
c 


2 o£ 


> 


; pj 




3 r- 



Q 



Q 

ON 



ON 



E 



o • 



O 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 PCT/GBO 1/05458 

252 



NM.006273 


DI0040 


NMJ04708 


M23452 


NMJ04374 


c-M 
oo 
-d- 
cm 
c=> 
CO 

2: 


NM.OI42I0 


J04205 


NM 005623 


NM JJ02089 


NMJ06938 | 


cO 

co 


to 
t/~k 
vo 
CO 
CO 
CO 

1 


vO 


VO 
vO 


CO 
VO 


CP 

1— 


i — ■ 




vo 
r- 
xr 


oo 
r- 


co 
oo 

XT 


CM 

oo 


oo 

XT 


vO 
oo 


CO 
CO 

XJ- 


NP 006264 


BAA00931 


NPJ04699 


vo 

co 
VO 

C"> 


NPJ04365 


NP.002473 


NPJ55025 


AAA51885 


NPJ056I4 


NPJ02080 


NPJ08869 


NP.00I502 


vo 
X*" 
vo 
CO 

CO 

<o 

t 

p- 


n 
vo 


to 

vO 


r— 
vo 


OS 

vo 




1 — 


to 
r— 


1 — 

r— 


r- 


oo 


m 
oo 


»o 
oo 


r- 
oo 


Repression 


Repression 


Repression 


Repression. 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 


Repression 
























recursor 






CM 
















CO 
CM 




Cl 

CM 






CZ 




















cr 






long-chai 


Programmed cell death 5 / TFAR19 




Vic 




*< 

CM 




CM 


ory protei 


nSM D 1 


lory protei 




Monocyte chemotactic protein 


Fatty-acid-Coenzyme A ligase, 


Small inducible cytokine A3 


Cytochrome c oxidase subunit 


NASP histone-binding prot. 


Ecotropic viral integration site 


Sjogren syndrome antigen B 


Monocyte chemotactic protein 


GR02/ macrophage inflammai 


Small nuclear ribonucleoprotei 


|GR0I/ macrophage inflammai 


Lymphocyte adhesion molecul 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GBOJ/05458 



253 

o 



CO 
CO 



co 




OO 









CO 


co 




-o- 


co 


c^> 


to 


CO 






to 




CO 


^0" 


^a- 


co 


CM 


c— 


to 


tO 




to 


CD 


to 


JO 












CO 


On 




r- 


oo 




Ol 






CD 


CO 




to 


CD 


co 


to 


CM 


ON 


to 


— 




ON 


to 


CM 


CO 




o 




ON 




vo 


to 




vO 


oo 




vo 


' — 






CM 




vo 


CD 


co 


CD 


CD 


r— 




CO 




CO 


oo 


CD 




* 










r— 


r*— 




^ 




r~- 


to 


CO 






oo 




vo 


OO 


OO 


ON 


CD 


r~ 




CD 




CD 


r — 


oo 


CO 




























































> 








<o 




<0 


CO 




CO 


CD 


co 


CD 


CD 




CD 


CD 






CD 


CD 


CD 


CD 


CD 


co 


CD 




CD 


CD 


CD 


es i 




cr 




























CO 






J 






—J 


CQ 




fad 






CQ 






CO 




































2> 




-<£ 










-«fiC. 






as 




CO 










z 








z: 


X 








































co 
























































CO 
CO 




O 
























































o 
































































o- 


2 


CM 






VO 




oo 


CD 






VO 




oo 


CD 




CM 






vo 


OO 


CD 


CM 






vo 


oo 


CD 






o> 




























CM 




CM 


CM 


CM 


CM 


CM 


CO 


co 


co 




CO 


CO 




O- 




CO 
























































cr 




CO 

-a 
























































oo 
u- 




o 




























































o 




























































o 
























































E 




























































cr 




2: 
























































cr 




cr 


cr 














OO 


vO 


On 
















■^0" 






^ 


CD 












o 




"a> 


O 


— 




OO 


vo 




s 


ON 


CO 


On 


— 




OJ 


S 






I — 


r— 


CO 


CD 




r»- 


to 




CM 


vo 


CD 


*00 




oo 
oo 


CM 




on 


CO 




cJ 


vo 




oo 






VO 


CDN 




CO 




-^r 


ON 


t — 


vo 


to 


oo 






vo 




OO 




o 


o> 




to 


VO 




CO 


to 


OO 


^a- 


CO 




oo 


CD 






CD 


CD 




OO 


CM 


to 


ON 




to 


CM 


VO 


CO 






QJ> 


f — • 






*^3- 




— 


to 


CD 


i — 


CD 




CD 


VO 




r— 


VO 


VO 


vo 


CD 


On 


to 


CO 




VO 


On 


CO 


co 
o 




Cu 


CO 

co 


CO 




CO 


r- 






CD 


CD 




CD 




_ ] 


CD 




Pu 


CD 


CO 


<o 


t— 


<c 


CD 


CD 




CD 


<c 


ac 


co 








1 

CU 




*C 








O- 


I 

CU 


•< 


1 

CU 




< 


1 

P- 




<: 


I 

CU 


Cu 


cx. 






1 

CU 


I 

CU 




1 

Cu 


-< 


< 


rein to 












ej 






CQ 




X 


PQ 








2: 








21 


z: 




CQ 


z: 


X 




a; 


ca 




CD 




cr a 






CO 


to 




r~ 


ON 




CO 


to 




l — • 


ON 






CO 


CO 


to 


r— 


On 




CO 




to 


r— 


On 






co 
































CM 


CM 


CM 


CM 


CM 


CM 


CO 


CO 




CO 


CO 


CO 


-o 




CO 


























































c= 
























































referen 




Protei 
































































CO 


CM 


r— 


cr 


VO 


CM 


•«o- 




vo 


CO 


VO 


cr 


cr 


OO 


CO 






CD 


cr 


cr 


r- 


oo 




CM 


cr 


VO 


co 




6 

CO 




on 


o 
oo 


CO 
CO 


CO 




CM 


ON 


oo 

CM 


r»- 
oo 


on 


oJ 




"o> 


oo 


CM 
OO 


CO 


CO 


CD 

to 


*S 


CO 


CM 


CD 
CO 




CM 


CO 


CO 
CM 


CO 








o 


cd 


CD 


o 


CD 


CD 


CD 


CD 


CD 


CO 


CD 


o 


O 


to 


CO 


CD 


CD 


CD 


o 


o 


CD 


CD 




CQ 


o 


CD 








OO 


CO 


CM 
' > 


cx 


PU 

-<d- 
co 


AA 


VV 


CVI 


VV 


OO 


CD 
O 


cx 
ro 


cx 


L0C5 


RO 




— i 


CM 


■_ 
cx 

vo 


cx 

VO 


VV 


CM 
> 




CQ 

CO 


w 

cx 

vo 




•so 




O 






CE 


Pu 
cr 


CO 
CO 


tvl 

PU 

o 


fad 




ti- 
er 




<o 

t — i 


CE 


CM 


CO 

ox> 
c= 


ex. 
c= 


Pu 


Pu 

cr 


Pu 
cr 




•CO 




PU 

cr 




tor 


Pu 


EQ ID N 








FLJ136 


PLA 


al protei 


hypothe 






al protei 




FLJ137( 


CU 


FZP586GI 


e zinc fii 




al protei 


al protei 


al protei 


al protei 


ZP564D 


KIAAI 


1 protein 


al protei 




clear fac 


FZP434I 


o 
cx 

"cO 
CO 














o 










o 








>• 




c_> 


e_> 


o 


o 


uu 
be: 




ca 


CO 




nu 


fad 


CO 


:es 












O 










flS 




a: 




b*rt 


CO 




CO 


CO 


CO 
















Q 










Q 


















o 




O 


ro 




J= 


_cr 


_jrr 


-cr 


o 




jrr 






pa 




o 
cx 


cr 








<o 




o 
cx 










o 
cx 




o 






cx, 




o 
cx 


o 
cx 


O 

cx 


o 
cx 






o 


o 
cx 




CO 




Hy 


o 
















































cx 










seqi 












ac 










rr. 
















EC 


ut: 








5^ 


ac 




o 
cx 

So 






ide 






















































as 






o 




CO 








r— 


cr 


VO 


cm 






vo 






cr 


CM 




CO 






CD 


vo 


vo 


CM 


CO 




CM 


On 


VO 


<o 




e 

CO 








CO 


co 








oo 


r— 


CO 










CM 


CO 


CO 


CD 




r— 


i— 


CD 




CM 


CM 


CO 








VO 




<o 




CM 


ON 


Ol 


oo 




r- 




OO 






to 




CO 


CQ 


CO 








CM 


CJ 








CO 




CO 


O 


o 


CD 


<o 


CO 


CO 






o 


CO 




CD 


CD 


CD 


CD 


Q 






CD 




CQ 




co 


=3 

cr 












ex. 


eu 

^o- 


•< 




Ol 


<: 


o 




cx 






O 






CM 




<: 


CU 


CM 




CQ 


< 




-O 
















co 




< 






o 




w 






CtS 


-J 


13 




vo 
to 




c< 


!— j 




CO 






C 




CO 




Pu 




ti- 


CO 


-o- 














CO 






CU 


Pu 


Pu 


Pu 






PU 




1 — 


Pu 


CO 














o 


Cl 




fad 








oo 












CU 










o 












cr 




er 




tS3 
Pu 










>-» 




cr 






cr 


cr 


cr 


a 


tsi 






cr 




act 


cr 


cr 








a> 




"5 


o 


cr 


cr 




cr 


co 






cr 




CO 




o> 


o> 


PU 


cr 




o> 






CO 










o 




o 


_cr 




CO 




O 


CO 


cr 




o 


CO 




o 


o 


o 


o 




CO 




o 








o 










cx, 




cx 


CX Q 


o 


o 


cx 


o 

l_ 


• o 




cr 


o 




cx 


cx 


cx 


CX 


Q 


o 

u 




V— 

Cl 




leai 


o 


cx 


ex, 


















cx 


cx 




cx 


cr 




isj 


cx 












cr 


cx 








cx 












"co 




"cO 


ac 








CO 




CO 




o 






"co 


CO 


*CO 


"cO 


"5 






"co 




CO 




"co 


*co 








co 




o 






*CO 


~cO 


O 


"cO 






CO 




o 


o 


CO 


o 


"cO 




o 




so 


"co 


CO 




















c_> 


o 




c_> 


CO 




CO 


o 












o 


CO 








cr 


o 




oo 








*eo 




CO 










a> 




_o 








"co 


CO 


CO 




1— 














tt> 


CO 












-cr 






CO 


CO 


"S 








o 






-trr 


-cr 


or: 


o= 


cx 


CO 




JET 




*cO 


CO 




o 






























o. 
























o 






cr 








o 




o 










o 














o 


o 


O 


o 


"co 






o 








O 










ex. 




cx 






o 


o 


cx 


o 






o 


o 




cx 


cx 


CX 


cx 


o 


o 




cx 




CO 


o 


cx 










r»% 




>> 






O- 


cx 




cx 






CO 


cx 




>^ 




r>-. 


>^ 




cx 










o. 


So 










ac 




ac 






so 








CO 




s 


r>-» 












CL> 












>> 


ac 


V 




















DC 




3C 


cx 

CO 




Hei 














_cr 
o 








o 
cx 

SO, 






OO 
OO 












































cx 










rc 






O 




























































cr 




























































r-» 

C4 












cm 


CO 




ON 


OO 


to 


oo 


r— - 






ON 




CO 




CM 




ON 


CM 


to 


vo 




CO 


CM 


vo 












Pu 










PU 




PU 




CM 


Pu 






O 


Q 


Q 


Q 












CM 




Cx3 ■ 








Pu 






PU 




pu 






Pu 






PU 






s 










Q 


O 


Q 










BL 


cm 


lon( 




CX, 




cx 


CX 




cx 


cx 


cx 


cx 


cx 




cx 


cx 




cx 


cx 


cx 


cx 


c_ 


cx 


cx 


cx 




cx 


CX 


cx 


■< 




























































EH 


oo 



























































SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/G BO 1/05458 



254 



NMJI823I 


NM 018321 


NM 017917 


NM 016391 


NM 016489 


NM 014665 


NM 016464 


I NM 018384 


AB051455 


AF380995 




AK025902 


NM 005761 


to 
<=> 

_J 


NMJI4584 


NM_0I4584 


BCOI0005 


AK00I903 


980900 WN 


CM 

ON 

ro 
m 

CM 


o 

to 

z. 


• R62339 


r— 

ON 
CO 

<C 
<t 


| R 0660 1 | 






VO 


CO 


to 


uo 


to 


VO 

*o 


CO 

in 


vo 




CM 

vo 


nO 


vo 
vo 


OO 

vo 


CO 
vo 


cz> 


CM 




vo 


CO 


O 
CO 


CM 
CO 


CO 


BAA9I830 


NPJ6070I 


NPJ60387 


Q9Y3CI 


NPJ57573 


NPJ55480I 


CO 

to 
r-~ 

lO 

<z? 
Cu 

a; 


XP 004747 


BAB33338 


AAK5751 8 




None 


NPJ05752 


None 


NPJ55399 


NPJ55399 


XPJ5338 


None 


NPJ06077 


AAD43048 


NP.037542 


None 


None 


None 




CO 


to 


r-- 


Ov 


to 


CO 

to 


to 
to 


r- 
to 


ON 

to 




NO 


CO 
VO 


*o 

VO 


vo 


r — 
vo 


ON 

vo 




CO 

t — 


to 
r- 


r- 


ON 


CO 


CO 
CO 


hypothetical protein FLJI08I5 


Hypothetical protein FLJI 1 100 


hypothetical protein FLJ2064 


Hypothetical protein HSPClll 


hypothetical protein LOC5I251 


K1AA00I4I 


Hypothetical protein H S PC 1 96 


Hypothetical protein FLJI 1296 


Hypothetical protein bA395LI4 


cDNA FLJI 3016 fis, clone 
NT2RP3000624 




cDNA DKFZp586H0324 clone 
DKFZp586H0324 


Clone 23785 


cDNA DKFZp586EI624 


cDNA FL J 1 4 1 62 fis, clone 
NT2RM4002504 


cDNA FLJ14162 fis, clone 
NT2RM4002504 


cDNA DKFZp434EI723 (clone 
DKFZp434EI723) 


cDNA FLJI 1041 fis, clone 
PLACEI004405 


cDNA FLJI0433 fis 
NT2RPI000478 


r— 
O 

co 
cu 

tsi 

-< 
z. 
o 

o 


cDNA FLJ23313 fis, clone 
HEP119I9 


ESTs 


ESTs 


L ESTS 


Hypothetical protein FLJ10815 


Hypothetical protein FLJI 1100 


Hypothetical protein FLI20644 


Hypothetical protein CGI-1 17 | 


Uridine 5' monophosphate hydrolase 1 


Hypothetical protein KIAA0014 


• Hypothetical protein HSPC196 


Hypothetical protein FLJI 1296 


Hypothetical protein KIAAI668 


SECIS binding protein 2 




cDNA:FLJ22249 fis, clone 
HRC02674 


Plexin CI 


cDNA DKFZp586EI624 


EROI (S.cerevisiae)-like 


EROI (S.cerevisiae)-Iike 


Hypothetical protein 
DKFZP434E1723 


cDNA FLJI 1041 fis, clone 
PLACE1004405 


Tubulin, beta, 4 


Insulin induced protein 2 


Adenylate kinase 3 


Novel PI-3-kinase adapter 


CO 


Novel Metallothionein 


o. 




Umd | 




Cu 




O. 


CO 
CU 


vo 
cu 


ex. 




oo 
J-U 

Cu. 


plEl 8 


plE16 


vo 

o 

Cu. 


vo 

Cu* 


CM 
U3 
Cu 


plElO 


vU 
a, 


CD 

s 

C 


pID 13 


o> 
c 


EI 

L C 


i — 

UJ 
u "cu 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GBOJ/05458 



255 



NM 022073 


NM 006115 


NM 022051 


AB051532 


AB051 532 


NM 021734 


NM 021626 


AY0I0I12 


NM 017813 


AA52I3I4 


AK056230 


R5I835 


AF0750I8 


oo 

ON 
VO 


AK0241 03 


NO 

CD 

CM 
O 

CD 

t 


NM 000314 


T73780 


NM 005962 


NM_005907 


AA44636I 


AA9314I1 


AK023505 


wo 

CM 
CD 

z. 


AF275725 


AK022894 


A A 25 1 748 | 


VO 

oo 


oo 
oo 


CD 


ON 


cm 
on 


o> 


vo 
ON 


oo 

ON 


cd 
cd 


cm 
cd 


CD 


vo 


oo 
cd 


CD 






vo 


OO 


CD 


CNJ 
CNJ 


CM 


vo 
rM 


oo 

CM 


CD 

CO 


CO 


co 


vo 

CO 


NP.07I356 


901900 dN 


NPJ71334 


VO 
CO 

oo 

cm 
CO 

CO 


BAB2I836 


CD 

oo 

CO 

oo 
vo 
CD 


NPJ67639 


None 


AAK52336 


None 


BAB71I24 


None 


None 


None 


None 


NPJ02052 


NPJ00305 


None 


NPJ05953 


NPJ05898 


None 


None 


None 


BABI5424 


AAK69052 


XP 043394 


None 


OO 


r-- 
oo 


ON 

oo 


on 




CO 
ON 


ON 


r— 

ON 


ON 
ON 




CO 

o 




cd 


ON 

CD 




CO 






ON 


CM 


CO 
CM 


uo 
cm 


r — 

CM 


ON 
CM 


CO 


CO 
CO 


uo 

CO 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTsI 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


EGL nine (C.elegans) homolog 3 


PRAME 


C!orfl2 


Semaphorin 4b 


Semaphorin 4b 


ON 

<: 

v> 

CM 

CJ 
»— 3 

CO 


Serine carboxypeptidase 1 


Unknown mRNA (schizophrenia- 
linked) 


Myo-inositol monophosphatase A3 


EST 


Hypothetical protein FLJ3I668 


EST 


cDNA YI27F12 


E— 

CO 

U2 


cDNA FLJI404I fis, clone 
HEMBAI005780 


Glutam ate-cysteine ligase, modifier 
subunit 


PTEN 


EST 


M AX -interacting protein 1 


Mannosidase, alpha, class- 1 A, 
member 1 


E— » 

CO 

PQ 


EST 


cDNA FLJ13443 fis, clone 
PLACE1002853 


Hypothetical protein FLJ22622 


CD84-HI 


Hvoothetical orotien FLJ12832 


E— 

CO 
PQ 


vo 
PJ 

cx 


OQ 

CM 

ex. 


Q 
cx 


r- 

O- 


O- 

cx 


CM 

CJ 
cx 


CO 

o 

CX 


ex. 


CD 
CM 

pa 

CX 


<T 
cm 
cx 


r— 
PJ 

Cx 


plEI9 


ex 


0. 


CO 

CM 

PJ . 

CX, 


CM 

PJ 

ex. 


p!D23 


P1D24 


p)D22 


CN» 
W 

cx 


pa 

o 


PJ 

L O 


oo 

s 

CU 


d1D21 


d!C22 


CNJ 

CJ 

L C 


plDIl 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GB01/05458 



256 



NM 000104 


AB03295I 


N M _02 1 1 7 5 1 


R68736 1 




NM.006456 


AL050021 


AK002I64 


NMJ32377 


XMJI2933 


AA420992 


NMJ03470 


o 

ON 
CO 
CM 
CO 

z. 


AK023680 


m 

CM 
C3 
CO 
CO 


AA664228 


R44397 


NMJ0I874 


L W87747 


AA973568 


1 T98S29 


AA022679 


H1792I 


co 

CO 

r-. 
cr> 

I 

2: 


W9I958 


R63694 | 


CO 

m 


CO 


CM 






VO 


OO 


O 
vo 


CM 
VO 


vo 


VO 
VO 


CO 
VO 


VO 


cm 
vo 


vo 


vo 
vo 


oo 
vo 




CM 




vo 


oo 


CO 
CO 


CM 
OO 


CO 


vo 
oo 


NP 000095 


XPJ12932 


NPJ66998 


None ( 




NP.006447 


None 


None 


XPJ32794 


AAL14467. 


None 


NPJ0346I 


NPJI5766 


None 


NP 149014 


None 


None 1 


NPJ01865 


None 


None 


None 


None f 


None 


NP.009297 


None 


None 


r-— 


ON 
C*~l 




co 




VO 


t — . 


ON 


vo 


CO 
VO 


VO 
VO 


VO 


ON 
VO 


vo 


CO 
VO 


vo 

vo 


r— 
vo 


ON 

vo 




CO 

t — 


VO 

r- 


r— 
t— 


ON 


oo 


CO 
CO 


vo 

CO 


ESTs 


ESTs 


ESTs 


ESTs | 




cDNA FLJI4028 fis, clone 
HEMBA 1003838 


cDNA DKFZp564DOI6 (clone 
DKFZp564DOI6) 


cDNA FLJI 1302 fis, clone 
PLACE 1009971 


NEDO FLJ10309 fisci 
NT2RM2000287 


Sequence from clone RPl 1-39402 
on ch 20 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


EST ESTs 


CYPIBI 


Hypothetical protein KIAA1125 


Hepcidin antimicrobial peptide 


EST | 




Sialyltransferase 


cDNA DKFZp564D016 


cDNA FLJ1 1302 fis, clone 
PLACE1009971 


Hypothetical protein MGC4549 


ELM02 


E— 

CO 

to 


Ubiquitin specific protease 7 


Nucleolar phosphoprotein Nopp34 


cDNA FLJI 36 1 8 fis, clone 
PLACE10I0925 


Hypothetical protein FLJI 351 1 


Ribosomal RNA intergenic spacer 


E— 

CO 

to 


s 

<L> 

CO 

cx 

a> 
o 

>- 

X 
O 
-O 
u. 

C3 


E— • 
CO 

PO 


E— 1 
CO 


E— 

co 

ua 


V EST 


E— 

CO 

pa 


ABL 


E— 1 
co 

to 


ro 
tO 

a 


plD20 


to 

cx 


ON 

Q 
o. 




CM 
CU 


ex. 


ex. 


CM 

ex. 


CO 

ex, 


p 1 110 


plH 1 8 


plH24 


CM 
CM 

to 


p 1 H2 f l 


Ilia 


ol H 14 


olHIl 


olHI7 


D1H12 


r- 

n: 

o 


olH15 


D1H20 


CO 

o 


olH16 


ON 

EE 

CX 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



257 



PCT/GBO 1/05458 



AK024747 


AA909912 


T99032 


H52503 


A A 1 27017 


R38647 


T87233 


NM 018233 


AF075053 


r— 
-cr 
co 

CD 


BC005845 


CD 
CD 


BC00I6I2 


AA679939 


AA630167 




NMJ04331 


NM J06516 


NM J330.16 


NMJ05566 


NM .005566 


NMJ01993 


NM.003376 


NM J06096 


NMJ009I7 


.NMJ04052 


NMJ00602 


NM_000602| 


OO 

oo 


CD 
On 


CM 

ON 


-<=r 
on 


vo 

ON 


CO 
On 


CD 
CD 
CM 


CM 
CD 
CM 


CD 
CO 


VO 
CD 
CO 


OO 
CD 
fN 


CD 
CO 


CO 
CM 


CO 


vo 
CM 




CO 
CM 


CD 
CM 
CM 


CM 
CM 
CM 


CM 
CM 


co 

CM 


vo 

CO 
CM 


CO 
CO 
CM 


CD 
CO 
CM 


CM 
CO 
CM 


CO 
CM 


vo 

CO 
CO 


vo 

CO 

CO 


AAHI4003 


None 


None 


None 


None 


None 


None 


BABI4226 


None 


NPJ78987 1 


AAH05845 


NP.001435 


None 


None j 


None 




NP.004322 


NP 006507 


t — 
CO 

ON 

CO 

1 

CU 

2: 


NPJ05557 


NPJ05557 


NPJ01984 


NPJ03367 


NP 006087 


NPJ00908 


NPJ04043 


NPJ00593 


NP 000593 


CO 


On 
OO 


ON 


co 
o\ 


vo 

ON 


c — • 

ON 


ON 
ON 


CD 
CM 


co 

CD 

CO 


CD 


CD 
CM 


ON 
CD 
CM 


CO 


CO 
CM 


m 
CM 




r— 

CM 


ON 

CO 


CM 
CM 


CO 
CM 
CM 


CO 

CM 
CO 


CM 
CM 


CM 
CM 


On 
CM 
CM 


CO 

CM 


CO 
CO 
CM 


vo 

CO 

CM 


VO 
CO 
CM 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 


ESTs 1 


ESTs 


ESTs 


ESTs 


ESTs i 


ESTs' 




| BCL2/adenovirus ElB 19kD- 
| interacting protein 3-like 


Solute carrier family 2, member 1 


PDGF beta 


lactate dehydrogenase A. 


lactate dehydrogenase A 


Tissue factor 


Vascular endothelial growth factor 


RTP/NDRGI 


Procollagen-proline 4-hydroxylase 
alpha 1 


BCL2/adenovirus ElB-interacting 
protein 3 


Plasminogen activator inhibitor, 
type I 


Plasminogen activator inhibitor, 


Hypothetical protein FLJ21094 


cn 
UJ 


EST 


E— • 
oo 

to 


f— 

CO 

tU 


CO 

tu 


E— • 

CO 

tu 


Hypothetical protein FLJ10826 


cDNA YO23H03 


Hypothetical protein FU22690 ! 


Mitochondrion sequence 


Fatty acid binding protein 5 


Mitochondrion sequence 


E— 

CO 

W 


E— 

go 
tu 




BCL2/adenovirusEIB19kD- 
interacting protein 3-like 


SLC2AI 


PDGFB 


Lactate dehydrogenase A 


Lactate dehydrogenase A 


Tissue factor 


VEGF 


N-myc downstream regulated 


Proline 4-hydroxylase, alpha 
polypeptide I 


BCL2/adenovirus ElB-interacting 
protein 3 


Plasminogen activator inhibitor, type 

1 


Plasminogen activator inhibitor, type 


plH23 


plHIO 


vO 

o. 


pi H 1 3 


On 

:r 

CL 


CO 
cm 
O 

O- 


plG21 


plHI 


CD 
CM 

O 

Cl 


a: 

CL 


On 
O 

CL 


plH2 


OO 

O 

CL 


^ 

CL 


CO 
CL 








CO 

p- 

CL 


OO 
CL 


ON 

cx 


plB 17 1 


DIO20I 


CM 
CQ 

O. 


CO 

CQ 

CL. 




CO 

CQ 

CX, 


ON 
CQ 
CL 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



258 



PCT/GB01/05458 





vo 

<o 
o 
cd 

IS 
a. 


m 

ON 

m 
cd 

2: 


CD 
to 
*«3- 
C-J 
CD 
CD 

21 


CD 
»n 

ON 

to 

CD 
CD 


CO 

oo 

CO 

1 — ■ 

CO 

•<c 


ro 
ON 
NO 
CD 
CD 


ro 

ON 

vO 
CD 
CD 


CO 
On 
VO 
CD 
CD 


CO 

On 
NO 
CD 
CD 

2: 


ON 
OO 

O 
CD 
O 

21 


On 
OO 

O 
CD 
CD 

21 


o> 

CO 

CD 
CD 
CD 

2: 


NM.000I89 


^3- 
OO 

to 

CD 
CD 
CD 

2: 


-^3- 
OO 
lO 
CD 
CD 
CD 

2: 


-3- 
00 

to 

CD 
CD 
CD 

2: 


vO 
-«3- 
CD 
<N» 
CD 
CD 

\ 

2: 


NM J02046 


ON 

oa 

CD 
CD 
CD 

I 

2: 


oo 

^3" 

CD 
CD 

1 

2: 


to 

NO 

to 

CD 
CD 

I 

2: 


to 
vo 

CO 

<CD 
CD 
CD 

1 

2: 


NM_0I34I0 


CO 

wo 

CTn 

VO 

CD 
CD 

1 

2- 


C"NJ 

oo 
to 
o 

CD 
CD 

J 

21 


CM 

oo 

lO 
CD 
CD 
CD 

2. 


r— 
oo 
CD 
CM 
CD 
CD 

I 

2^ 


I — ' 
oo 
CD 
CM 
CD 
CD 

1 

2: 




CO 
CO 
OJ 


CD 


oj 
^3- 
cm 


^3" 


no 
n 


CO 

*<3~ 
cm 


CO 
CM 


OO 
-<3- 


oo 
-53- 


CD 
vo 
c-J 


CD 
>-o 
c-J 


CD 

to 

OJ 


CD 

to 

OJ 


to 
rvj 


io 

r-i 


OO 

to 

OJ 


to 

CM 


to 


NO 

to 

CO 


oo 
to 

Ol 


CD 

NO 

CM 


CM 
NO 
CM 


-<3- 
vO 
CM 


VO 

vo 
r-o 


oo 
VO 

CNl 


oo 

NO 
CM 


CD 
CM 


CD 
r — 

CM 




cr\ 
CD 
cd 
cd 

1 

Cu 

2: 


CTv 

to 

cd 
cd 

1 

cu 
2- 


•«=r 
-"3- 

CM 

o 

CD 
CU 


-3" 
ON 
WO 
CD 
CD 

1 

Cu 

3: 


CO 

c 
o 
2: 


cm 

NO 
CO 
CO 
CD 
CD 

1 

Cu 
2: 


NO 
OO 
OO 
CD 
CD 

cu 

a: 


C>J 
NO 
CO 
OO 
CD 
CD 

CU 

2: 


r>) 

NO 
OO 

oo 

CD 

I 

2: 


CD 
oo 

CD 
CD 
CD 

cu 

2: 


CD 
OO 

CD 
CD 
CZD 

1 

P- 

21 


CD 
OO 

CD 
CD 
CD 

1 

cu 
21 


CD 
OO 

CD 
CD 
CD 

Cu 

2: 


to 
r^- 
to 
CD 
CD 
O 

Cu 

2; 


to 

to 
CD 
CD 
CD 

( 

Cu 

2: 


to 
r-~ 
to 
CD 
CD 
CD 

CU 

2: 


I — 

CO 

CD 
CO 
CD 
CD 

1 

Cu 

2: 


I — ' 

CO 

CD 

CD 
CD 

1 

Cu 

2: 


CO 

CD 
CD 
CD 

Cu 

2u 


On 
"3" 
CD 

CO 

I 

CU 

2: 


NO 

to 
to 

CD 
CD 

1 

Cu 

2: 


NO 

to 

CO 
CD 
CD 
CD 

1 

Cu 

2: 


CM 
-3" 
to 
r— - 

CO 

CD 
Cu 

2: 


-»3- 
^3" 
CTN 

to 
CD 
CD 

cu 

2; 


CO 

t — 
to 

CD 
CD 
CD 

CU 

21 


CO 

« — 
to 

CD 

CO 

CD 
Cu 

2: 


oo 
I — - 
CD 
CM 
CD 
CD 

1 

Cu 

2: 


oo 
[ — 
CD 
CM 
CD 
CD 

1 

Cu 

2: 




r~ 

CO 

cm 


ON 
CO 
<N 


-<3- 

CM 


ts 


cm 








^3" 


ON 
OJ 


On 
'^3- 
OJ 


On 
OJ 


On 


to 

C^J 


to 

OJ 


to 

CNJ 


CO 

to 

04 


CO 

to 


to 
to 

CX 


to 

CM 


On 
tO 
CM 


NO 
CM 


CO 
NO 

CM 


to 
vo 
CM 


r— 

NO 

CNl 


r— 

vo 
CM 


On 
NO 
CM 


o\ 

NO 
CM 


<o 

Q. 
>-> 


o 

co 
CO 
C= 

co 

OJ0 

>s 

X 

o 
o 
o 

CJ 


DC 

CZ 
CO 
O 

o 

"co 


—l 

"o 
cz 
o 

o 
*co 


O 
c= 

CO 

cz 
o 

o 
"co 


CO 

o 

o 
cz 

9 

.£ 
*S 
es 
o 

o 

iS 
a> 


CO 

co 
_o 

E 
E 
c-o 
Zo 

i 

ca 
i — * 

CO 

CO 
CO 
CO 
S3 
O 
CO 


CO 

CO 
-0 

E 
E 
cm 
>-» 

B 

CO 
1 — . 

co 

1 — 

o 
co 

S3 
O 

co 


CO 

CD 
JO 

a 

E 

oT 

>-. 

i 

«-»_. 
a> 

C3 

o 
a> 

o 

CO 


CO 
O 

_o 
E 

£ 
*E 

cd 

<U 

C3 
O 

w 

S3 

O 
CO 


CO 

to 
ca 
cz 

"o 

X 

a> 


Ol 

CO 
CO 

"o 

X 
CO 

zt: 


CN) 
CO 
CO 

cz 
o 

X 

CO 

Pu 


CO 
CO 
CO 

cz 
o 

X 

CO 

DC 


OO 

• c= 

C3 

CO 

CO 

cz 


OO 

c= 

C3 
CO 

CO 

(= 


OO 
CZ 

jm 

CO 

CO 

cz 


<L> CO 
CO co 

-cz cz 
O- <o 

co oj; 
° O 

cA ^ 

CO ^ 

CO 
TS3 

CO 

CO 

CO 

>-» 

o 


CO CO 
CO CO 

-cz cr 

CU CO 

«^ ox 

^ s 

o- -o 

CO « 
-o ^ 

-cz 

cO 

-o 

CO 
w> 

CO 

z»-» 

o 


CO 
CO 
CO 

cz 

CO 
CO 

I— 

CO 

o 

O 
o. 

CO 

O 
Cu 


CO 
CO 
CO 

O 
CZ 

W 


CO 

-cz O 

s S 

co 

_o 

CO 
co 
O 

CO 
S3 

CJ 

CO 

CO 
CO 

o 
-o 
'co 


CU 

CO 
co 

CO 
CO 

E 

O 

CO 
CO 

CO 
-CZ 
CU 
to 
O 

_CZ 

CO 
co 
O 

E— 1 


CO 

CO 

CO 
co 

CO 
CZ 

CO 

co 

>N 

CZ 

CO 

-o 
<C 


CO 

CM 

cz 

CO 

cz 
o 
lo 
o 

"eO 
CO 


cs 

cz 
o 
o. 

o 

CO 

co 

o 


■cz 

CZ 

o 
o_ 

o 

CO 
CO 

o 


cz 

S3 

cz 

CO 

CD 


cz 

S3 
CZ 
CO 

o 




>< 

o 


dc 

cz 

CO 
CZ 

o 
o 
CO 


-a 

cz 

*o 
cz 

.2 

JC 

o 

' CD 


o 

co 
cz 
o 

o 

"c3 


"co 
cs 
o 

o 
cs 

*2 

W 

_CZ 

"S 
o 

-CZ 

_© 

co 


CO 

-< 

CM 

CJ 
—3 
CO 


CO 

CJ 
CO 


CO 

<: 

O 
co 


CO 

►—3 

co 


t-sl 
CO 

ca 
O 

o 
DC 


CO 

oo 

CO 

CZ 

o 

CO 


<N 

CO 

CO 
CO 

cz 
o 

CO 

DC 


CNl 

CO 
CO 
CO 

c= 
o 

X 
CO 

DC 


OO 

cz 

=3 
CO 

CO 


OO 

cz 

=3 
CO 

CO 


oo 
cz 

^£ 
C3 
CO 

0> 

cr 


DC • 

Q 

Cu 

o 


DC 
O 

Cu 

o 


CO 

CO 
CO 

cz 

CO 

CO 

CO 
CO 

>~ 
toi 

O 
JZ 
CZi. 
to 

o 
^cz 
Cu 


CO 

CO 
CO 

O 
CZ 

PJ 


CJ 
CO 

CO 

CO 

o 


CO 
CO 

CO 
k_ 

CO 

E 
o 

CO 

CO 

O 

-CZ 
CL 

CO 

O 

E— • 


CO 

CO 
CO 
CO 

cz 

OJ 
CO 

s>- 
cz 
to 
-o 


< 

CM 

cz 

*CO 

cz 
o 

o 
"co 

CO 


cz 

cz 
o 
O- 
o 
o> 

CO 

o 


cz 

cz 

o 

CL 

o 

CO 
CO 

o 


cz 
cz 

2 

o 


cz 

"so 
cz 

CO 
t« 

o 




r— 

3: 

CU 


«<3- 
cm 

<c 

o. 




CO 

o. 






o. 


CO 

o. 


'"3" 

<: 


to 


NO 
O- 


r— 
o. 


oo 

O- 


•t3- 
CQ 
o. 


to 
CO 

o- 


NO 

CQ 


Cu 


-< 

Cu 


CO 

<c 


-«3* 
< 


ON 

. o- 


CD 
CM 

<c 

o 


CM 
CM 

< 


< 

L C 


CD 
CM 
CQ 

L O 


CM 

CQ 
- o 


CJ 
t c 


oo 

o 

L O. 



WO 02/46465 



PCT/GBOJ/05458 



259 



OJ 

co 
cO 
cO 

CD 

f 

z 


NMJ0I975 


ro 
CD 

OJ 
CD 
CD 


o- 

CM 

vo 
CD 

CD 

1 

2. 


NM .005962 


BC008022 


NM 004000 


NM_001 186 


NM 002633 


AFI5I867 


NMJ03864 


NMJ05502 


A J 1 3 1 244 


MI7783 


NMJ0I365 


NMJ05079 


NMJJ04064 


NM_0062I9 


AF027153 


00 

OO 
OJ 

^ 

CD 
CD 

Z. 


NMJ03039 


ON 

ro 
CD 
ro 
CD 
CD 


OJ 
OJ 

CD 
CD 


NMJ01122 


NMJ0II22 


OJ 
OJ 

CD 
CD 


t 

Ol 

CD 
CD 

2: 


ro 
t — 

CD 
CD 

2: 


[ NM_0I6224| 


o> 

1 — ■ 
oJ 


oJ 


r— 

OJ 


00 
r— 

OJ 


cd 
00 

Ol 


CO 
OJ 


00 
OJ 


vo 
OO 
o» 


OO 
OO 
Ol 


CD 
On 
OJ 


OJ 

ON 
Ol 


ON 

01 


vo 

ON 

OJ 


CO 
ON 
OJ 


CD 
CD 
ro 


OJ 
CD 
co 


CD 
CO 


VO 
CD 
CO 


00 
CD 
CO 


CD 
CO 


OJ 
ro 


OJ 
rO 


rO 


ro 


ro 


co 


VO 
CO 


CO 
CO 


CD 
OJ 

CO 


vO 
-d" 
r— 
ro 
O 

CU 

2: 


NP 001966 


NP_002094 


NPJ016I8 


NPJ05953 


None 


NPJ0399I 


NPJ01177 


NPJ02624 


Q9Y3B3 


NPJ03855 


NPJ05493 


CAA10334 


AAA35883 


NPJ01356 


NPJ05070 


NP.004055 


NPJ06210 


AAC39548 


NP_004279 


NP_003030 


NPJ03030 


NP_001 1 13 


NP_001113 


NP 001113 


NPJ01113 


NP 004418 


NP 001722 


NPJ57308 


c— 

0> 


ro 
o- 
oj 


10 
r— 

04 


o- 
r- 

Ol 


ON 

o- 

Ol 


00 

OJ 


ro 
00 
Ol 


to 

00 
Ol 


DO 

oj 


ON 
OO 
Ol 


ON 

OJ 


CO 
ON 
OJ 


to 

ON 
OJ 


O- 

ON 

OJ 


ON 
ON 

OJ 


CD 

CO 


CO 
CD 
ro 


to 
CD 

CO 


r— 
CD 
CO 


ON 

CD 
CO 


CO 


ro 


CO 
CO 


ro 

CO 


CO 
CO 


CO 
CO 


to 

CO 


CO 


ON 
CO 


Hypoxia-inducible protein 2 


Enolase 2, (gamma, neuronal) 


Glycogen synthase 1 (muscle) 


Activated leucocyte cell adhesion 
molecule 


M AX-interacting protein 1 I 


Nuclear receptor co-repressor 


Chitinase 3 -like 2 


BACHl transcription factor 


Phosphoglucomutase 1 


. CG1-I09 protein 


SAP30 


ATP-binding cassette transporter- 1 


SEC24 protein 


Trinucleotide repeat containing 3 


Postsynaptic density protein 95 


Tumor protein D52 


Cyclin-dependent kinase inhibitor 
p 2 7 k i p. 1 


phosphoinositide-3-kinase, 
catalytic, beta 


Solute carrier family 5, member 3 


PSCDBP 


Solute carrier family 2, member 5 


Solute carrier family 2, member 5 


Adipophilin 


Adipophilin 


Adipophilin 


Adipophilin 


Early development regulator 2 


B-cell translocation gene 1, 


SH3PXI 


Hypoxia-inducible protein 2 


Enolase 2 


Glycogen synthase 1 


ALCAM 


M AX-interacting protein 1 


E— 

GO 

uj 


CO 
<D 

CO 
C3 


BACHl 


Phosphoglucomutase 1 


Hypothetical protein LOC510I4 


Sin3-associated polypeptide 


ABCAI 


SEC24 member A 1 


Glia-derived nexin i 


Postsynaptic density-95 


Tumor protein D52 


t — 

OJ 

cu 


PI-3-kinase, catalytic, beta 
polypeptide 


SLC5A3 


Cvtohesin binding protein 


f SLC2A5 


SLC2A5 


■ Adipophilin 


Adipophilin. 


Adipophilin 


Adipophilin 


Earlv development regulator 2 


B-cell translocation gene 1 


Sorting nexin 9 


00 

Q 

cx 


OlVld 


oJ 
cx 


ro 

OJ 

O 
cu 


to 

0 

cx 


r— 
O 
0. 


CO 
OJ 

ex. 


0 

cx 


to 

0 

cx 


ro 
OJ 

fcu 

O. 


00 
0 

cx 


CO 

0 

0. 


plGIO 


OJ 
PU 

cx 


OJ 

0 

ex. 


plGII 


vo 

0 

Ou 


ON 
O 
O, 


0 
0 


nIG14 


to 

*C 
, ... 

0 

L 


vc 

-< 

- c 


vo 
CQ 

l cb 


t— 

L "C 


OO 

pa 

L C 


o> 
pa 

L O 


r- 
C 

L — 

c 


5E 

c 


plF22 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



260 



PCT/G BO 1/05458 



NM 004354 


ALI37430| 


NMJ00I04 


NM .004926 1 


WO 

OO 

n 

OJ 

cd 

i 


NM.003467 


r- 
vo 
^l- . 

CO 
CD 
CD 

I 

Z. 


CO 

I — 
CD 

1 

X 


NM.006813 


AF078844 


UO 

oo 
o- 

CD 
CD 
CD 

1 


NMJ03028 


AK0234I8 


CD 
vo 
OO 

CD 
CD 


NM.006759 


ON 
ON 

CD 
CD 

t 

Z. 


ON 

ON > 

CD 
CD 

\ 

Z. 


ro 

<o 

CD 
no 
CD 
O 

\ 


ro 
O 
CD 
uo 
CD 
CD 

z. 


ro 

<G 
CD 

WO 

CD 
CD 

I 

;> 
z. 


<o 

ro 
CD 
CD 

1 

:> 


so 
r- 

OJ 

CD 
CD 

I 

Z. 


oi 

CO 


OJ 

CO 


SO 

CO 


oo 

OJ 
CO 


CD 
CO 
CO 


OJ 
CO 
CO 


OJ 
CO 
CO 


-<d- 
co 

CO 


vo 
CO 
CO 


oo 

CO 
CO 


CD 
co 


OJ 
CO 


co 


VO 
co 


OO 

CO 


CD 
WO 
CO 


CD 

to 

CO 


oi 
»o 

CO 


OJ 

wo 

CO 


OJ 
wo 
CO 


to 

CO 


VO 
WO 
CO 


NPJ04345 


XPJ49657 


NPJ00095 


NPJ04917 


WO 
VO 
CO 
CD 

1 

pu 


NPJ03458 


NPJ03458 


XP.OI7I3I 


NP.006804 


T08745 


NP.000776 


NPJ030I9 


NPJ6I130 


NP.00185I 


NP.006750 


NP.004190 


NP.004190 


NPJ05054 


NP 005054 


NP 005054 


NP 003637 


! NPJ02766' 


CO 


CO 
OJ 
CO 


w~> 
OJ 
CO 


t — - 

OJ 
CO 


on 

OJ 

CO 


CO 
CO 


CO 
CO 


CO 
CO 
CO 


WO 
CO 
CO 


t — 
co 

CO 


C\ 
CO 
CO 


>dj- 
CO 


CO . 
CO 


wo 
co 


o- 

CO 


ON 
CO 


ON 
CO 


wo 

CO 


wo 
CO 


wo 
CO 


CO 

to 

CO 


wo 
to 

CO 


Cyclin G2 


NAG-5 protein 


Cytochrome P450 1B1 (dioxin- 
inducible) 


Butyrate response factor 1 


p8 protein (candidate of metastasis | 


chemokine (C-X-C motif), receptor 
4 (CXCR4) 


chemokine (C-X-C motif), receptor 
4 (CXCR4) 


solute carrier family 16, member 6 


Proline-rich protein with nuclear 
targeting signal (B4-2) 


RNA helicase-related protein 


Cytochrome P450, subfamily 
XXVIIB, polypeptide 1 


SHB adaptor protein 


Papillomavirus regulatory factor 
(PRF-1) 


SLC31 A2/ hCTRI 


UDP-glucose pyrophosphorylase 2 
(UGP2) 


Proline 4-hydroxylase, alpha 
polypeptide II 




Stearoyl-CoA desaturase 


Stearoyl-CoA desaturase 


Stearoyl-CoA desaturase 


Diacylglycerol kinase, zeta 


Serine protease 1 1 


Cyclin G2 


Hypothetical protein LOC51754 


CYPIBI 


Butyrate response factor 1 


P8 protein (candidate of metastasis 1) 


CXCR4 


CXCR4 


Hypothetical protein XPJI7131 


Proline-rich protein with nuclear 
targeting signal 


Hypothetical protein hqp0376 | 


CYPl 


SHB adaptor protein 


Papillomavirus regulatory factor PRF- 

1 


SLC3IA2 


UDP-glucose pyrophosphorylase 2 


Proline 4-hydroxylase, alpha 
polypeptide II 


Proline 4-hydroxylase, alpha 
polypeptide II 


Stearovl-CoA desaturase 


I Stearovl-CoA desaturase 


Stearovl-CoA desaturase 


DiacvIelYcerol kinase, zeta 


Protease, serine, 1 1 


OJ 

O 

o. 


1 plFll 


vo 

Ou 


1 PlFUl 


r— 

iZ 

o- 


o 


OJ 

o. 


CO 

IX 

CL 


plF20 


plF6 


CU 


plFI5 


CO 

lZ 

Ou 


r— 

<c 

o 


OJ 

<C 
cu 


CO 


wo 
CQ 

cu 


o 
cc 

c 


«— 

1 cc 

L C 


OJ 

cc 

X. c 


cc 

u C 


plB22 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GB01/05458 



261 



NMJ100577 


NM 006469 


NMJ0II05 


NM.014489 




NM 006499 


NM 000175 


NM 006023 


NM 003670 


NMJ16530 


NM.002922 


AB0090I0 


NMJ02205 


L07872 


NM JI6222 


OS 
04 

OJ 

CD 
CD 

1 


NMJ07021 


NMJ00405 


NMJ04779 




AA430382 


NM 002982 


NM 001955 


AA633656 


NM 000700 


NMJ02467 


NM_0000I4 


XMJ08449] 


OO 

vo 
co 


CD 
VO 
CO 


o4 

VO 
CO 


vo 

CO 




vo 
VO 
CO 


OO 
VO 
CO 


CD 

r— 

CO 


r— 

CO 


r— 

CO 


VO 
1 — 
co 


OO 

r- 

CO 


O 
CO 
CO 


OJ 
OO 
CO 


-<a- 

©O 
CO 


VO 
OO 
CO 


OO 
OO 
CO 


CD 
Os 

CO 


OJ 
ON 
CO 




ON 
CO 


VO 
Os 

CO 


OO 
Os 
CO 


CD 
CD 


OJ 

CD 


CD 

«a> 


VO 
CD 


OO 

CD 


OO 
vO 
vo 
CD 

cd 

CD 

1 

PL. 

•z 


NPJ06460 


NP.001096 


NPJ55304 




NPJ06490 


NPJ00166 


NP.006014 


NP.003661 


NPJ57614 


NPJ02913 


BAA23632 


NPJ02196 


AAA60258 


NPJ57306 


NP 002120 


OJ 
VO 
Os 

OO 
CD 

CD j 

\ 

ex. 

Z. 


NPJ00396 


NPJ04770 




None 


NPJ02973 


NPJ01946 


None 


NPJ0069I 


oo 

VO 

OJ 
CD 
CD 

CU 
Z. 


NPJ00005 


XPJ08449 


r- 
m 

CO 


ON 

vo 

CO 


vO 
CO 


CO 
VO 
CO 




VO 

VO 

CO 


0- 
vo 

CO 


ON 
VO 
CO 


o- 

CO 


CO 

r- 
co 


VO 

r- 

CO 


O- 
c — 

CO 


OS 
o- 

CO 


OO 
CO 


CO 
OO 
CO 


vo 

OO 
CO 


r— 

OO 
CO 


ON 
OO 
CO 


ON 
CO 




CO 

Os 
CO 


vo 

ON 
CO 


o- 

ON 
CO 


ON 
ON 

CO 


CD 


CO 

CD 


VO 
CD 


I — 
CD 


IL-1 receptor antagonist, 
alternatively spliced forms 


CO 
O 

cx 
tuc 
cz 
-o 

lo 

CO 

2: 


Activin A receptor type I 


FGF receptor activating protein 1 
(FRAGI) 




Galectin-8 


Glucose 6-phosphate isomerase 


D 1 23 protein 


Decl. 


-O 
OO 

_o 
p£ 


BL34 


Polyubiquitin UbC 


Integrin alpha 5 


Jk-recombination signal binding 
protein 


DEAD-box protein abstrakt 


High mobility group 2 protein 


Decidual protein induced by 
progesterone 


GM2 ganglioside activator protein. 


1 CCR4 associated factor 1 (CAFl) 




1 Nucleoside phosphorylase 


Monocyte chemotactic protein 1 


Endothelin 1 


Heat shock 70kD protein 4 


Annexin AI 


p67 myc protein 


o 
_o 

O 

CU] 

o 

1 — 

o 
ca 

E 

OJ 
ca 
cx 


Macrophage inflammatory protein 


j Interleukin 1 receptor antagonist 


NS1 -binding protein | 


Activin A receptor, type I 


FGF receptor activating protein 1 




Galectin 8 


Glucose phosphate isomerase 


DI23 


DEC-1 


.5 

o 
cx 

-O 
GO 

OO 

<C 

os: 


Regulator of G-protein signalling I 


Polyubiquitin 


Integrin, alpha 5 


Jk-recombinatidn signal binding 
protein 


Abstrakt 


High-mobility group protein 2 


Decidual protein induced by 
progesterone 


GM2 sanelioside activator protein 


CN0T8 




Similar to Nucleoside phosphorylase 


SCYA2 


| Endothelin 1 


Similar to Heat shock 70kD protein 4 


1 Lipocortin I 


MYC 


"» 
JO 
O 

o 

\ — « 

o 
ca 

E 

Ol 

ca 
-e: 
cx 

<: 


SCYA4 


CO 

oJ 
CQ . 


oJ 
CO 

CL 


plC3 


O 

CL, 




VO 
OU 


vO 
CJ 

cx 


pIC7 


OO 
CL 


os 
cx 


CD 

o 

cx 


o 

cx 


OJ 

cx 


plCI3 


cx 


VO 

G 

cx 


VO 

o 

cx 


Os 

o 


D1C20 






VO 

Ph 
o 


CO 
OJ 

-J 

L OJ 

o 




OS 

fad 

O 


olK23 


DIK15 


OO 

cx 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



262 



PCT/G BO 1/05458 





NM 001040 


j NM.0O2940 


NM 001762 


M37435 


NM 006360 


NM 004778 


*/-> 
to 
«n 
wo 

CD 
CD 

1 


! NM 005475 


NM 002515 


NM _003580 


NM.005729 


NMJ02664 


NMJ2I20I 


NMJ2120I 


NM.00100I 


NMJ03042 


MD 
CO 
CM 

r- 

CD 
CD 

1 


NMJ2I238 


NMJ06022 


AA988II0 


NM 002964 


NM.0029I3 


NMJ03I35 




cd 

-<d- 


cm 


-<d- 


VO 

*<d- 


OO 
"ST 


CD 
CM 


CM 
CO 

*<d- 


CM 
«d" 


VO 
CM 


CO 
CM 


CD 
CO 
"d" 


CM 
CO 


^d- 
co 
•*d- 


co 
->d- 


vO 
CO 

^d- 


CO 

CO 

^d- 


CD 


CM 


-<d- 


VO 

^d- 
-<d- 


CO 

'■d- 
"d- 


CD 
I/O 

^d- 


CM 
WO 

-d- 




• NPJ0103I 


NP_002931 


m 
*/o 
I — 

CD 
CD 

1 

CU 

z: 


AAA52I17 


NPJ06351 


NP 004769 


NPJ05546 


I NP 005466 


NPJ02506 


NPJ03571 


NP.005720 


»n 
wo 

VO 
CM 
CD 
CD 

1 

CU 


NPJ67024 


NP_067024 


NP.000992 


NP.003033 


NPJ09217 


NPJ6706I 


NPJ060I3 


None 


NP_002955 


NPJ02904 


NPJ03126 

i 




ON 
CD 

-^r 


■*d" 


m 

^dh 


wo 
*«d- 




On 
-d- 


CM 

^d" 


CO 
CM 


CM 
"d" 


CM 


ON 

CM 


CO 

-d- 


CO 
CO 

-d-' 


CO 

CO. 

^d- 


V/O 
CO 


t — • 

CO 

^d- 


On 
co 
-«d* 




CO 

-<d- 


wo 
^d- 
-<d- 


t — 

-<d- 
-d- 


ON 

^d- 


wo 
•*d- 




Sex hormone-binding globulin 


ATP-binding cassette, sub-family E 
(OABP), member 1 


Chaperonin / Tcp zeta 1 


Colony stimulating factor 1 
(macrophage) 


Dendritic cell protein (GAI7) 


G protein-coupled receptor 44 


Keratin 6A 


lymphocyte adaptor protein 


Neuro-oncological ventral antigen 1 


N-SMase /FAN 


Peptidylprolyl isomerase F 
(cyclophilin F) 


PLECKSTRIN 


High affinity immunoglobulin 
epsilon receptor beta 


High affinity immunoglobulin 
epsilon receptor beta 


Ribosomal protein L44 1 


Solute carrier family 6 Nol 


Synaptopodin ' 


TERA protein i 


TGF beta-stimulated protein TSC- 
22 


Tubulin, beta, 2 


Calgranulin A 


Repiication factor C (145 KDa) 


Signal recognition particle 19 kD 
protein 




Sex hormone-binding globulin | 


ATP-binding cassette El 


CCT6A 


Colony-stimulating factorl 


GA17 


GPR44 


Keratin 6B I 


f Ly m phocyte adaptor protein 


Neuro-oncological ventral antigen 1 1 


Neutral sphingomyelinase (N-SMase) 
activation associated factor | 


Cyclophilin F 


Pleckstrin 1 


CFFM4 


CFFM4 


Ribosomal protein L36a 


SLC6A1 


Synaptopodin 1 


TERA protein) 


TSC-22 


CO 
1X3 


Calgranulin A 


1 Replication factor C large subunit 


Signal recognition particle 19kD 




CM 

cx 


cu 


vO 


OO 
Cu 


pINI 


CM 

CM 

cx 


*<d- 

Cl 


• P'K13, 


pi J20 


plJ22 


CL. 


CO 
Cl 


ON 
V- — > 

Cu 


CM 
CU 


VO 
O- 


Cl 


oo 
Ci 


wo 
>— » 


-«d- 
Cu 


p2A14, 


CO 
CM 

CX 


plJ21 


^d- 

CM 
»— > 

cu 



SUBSTITUTE SHEET (RULE 26) 



WO 02/46465 



PCT/GBO 1/05458 



263 



AK026672 


ON 

oo 
r- 

04 
CO 
CO 


NM_005461 


NMJ06I41 


NMJ16587 


NM.006273 


OM 
OO 

CM 
CO 

1 


NMJ04708 


X52I49 


NM.002569 


CM 

oo 
to 

CO 


NM.0I42I0 


NMJ03I42 


NMJ05623 


ON 
CO 

CO 

CM 

CO 
CO 

1 


NM .006938 


to 

CO 
CO 

1 


to 
uo 
VO 
CO 
CO 
CO 

1 

IS 


to 


vo 
to 

*<3- 


oo 


<o 

vo 
-<=*- 


OO 

vo 


VO 


vO 
VO 


oo 

VO 


<o 
r— 


OO 

r- 
-^r 


^i- 
r— - 


vo 
r— 


oo 
r— 


CO 
CO 


oo 

CO 


oo* 


vo 
oo 


oo 
oo 


None 


co 

oo 

r— - 

04 
CO 
CO 

Pu 

2: 


NPJ05452 


NPJ06I32 


NPJ57671 


NP 006264 


t/~> 

ON 

vo 
vo 

CO 

1 

Pu 

z. 


NPJ04699 


CAA36397 


CO 
\o 
<-/^> 
OO 
CO 

<o 

Pu 


co 

CM 

CO 

CO 

J 

p- 

au 


«n 

oo 

CO 

to 
to 
CO 

cu 

z; 


NPJ03133 


NPJ05614 


NPJ02080 


NPJ08869 


NP_00!502 


NP 000646 


CO 


m 


r — 


ON 

to 




CO 

vo 
-d- 


*/o 

vo 


r- 
vo 


ON 

vo 


r— • 


CO 


to 

t — . 


o- 
o- 


ON 

r— 


oo 


CO 

oo 


vo 
oo 


r— 
oo 


Transcription factor SUPT3H 


Proteasome component C9 


Maf-related leucine zipper homolog | 


dynein, cytoplasmic, light 
intermediate polypeptide 2 


Heterochromatin-like protein 1 


Monocyte chemotactic protein 3 


Fatty-acid-Coenzyme A ligase, 
long-chain 2 


Programmed cell death 5 / TFAR19 1 


Small inducible cytokine A3 


Cytochrome c oxidase subunit Vic 


o 
o. 

to£ 

c= 

-o 
c 

_o 

o 
c= 

o 
Pu 

CO 
< 

Z. 


Ecotropic viral integration site 2A 


Sjogren syndrome antigen B 
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Sequence listing 

1 

MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDI 
5 VSYTTQAGEKMYFRKFFKFQVLKPLDVKT^ 

ECVSTFGSRAYLQPMDTRQYLYCLKPKNEFAEKAGIIKGVW^ 

TVNLEEPFH ITCKITNCSERTMDLVLEMCNTNS I HWCGI SGRQLGKLHPSSSLCLALTLLSSVQGLQS ISGLRLTDTFLKRTYE 
YDDIAQVCWSSAIKVES 

2 

10 AAAMGTGCCGGTCAAMTGGMGTGMTCCCCCTAMCAGGAGCACCTGCTGGCGCT^^ 

CTTTATTCACCMTATCCCAGTMCATGTGMGAGAMGACTTACCTGGAGATC 

CMCCGTTAATGGTGCAGMGTTTTMTGTTG 

CCTTTTCCAGTTATATCAGCGTTCATMTGATAGCMTCMGTTC 

CTCAGCGTTTAMTCTTTCAGCCTCCMTGCTGCAGTGGCTGMCTTAMCCGGAT 
15 MGTCAMGAMTTGGMCACACATCTTGGTATGTGCTGTGAGTTATACMCTCAGGCTGGAGA 

TCTTCAMTTTCAGGTTCTCAMCCATTGGATGTGAAMCCAMTTTTACMTGC 

MGTATTTCTGGAAGCCCAGATTCAGMTATGACAACCTCACCTATC^ 

ACMTGTMCAGMTTAMTTCAGTCAGCCMGCTGGAGMTGTC^^ 

ATACACGCCAGTACTTATACTGCCTAMGCCAMGAATGM 
20 GAAMTTGGATATAGTATGGAAAACAAATCTAGGTGAMGG 

ATGGAGATGTTAGGOTCTCTTTGGAGGCMTACCAGATArc^ 

ACTGCAGTGAMGGACTATGGATCTGGTTTTGGAAATGTGCMTACCMTTCCATCCACTGGTGTC 
TGGGAMGCTGCATCCMGTTCTTCGCTCTGTCTTGCCCTTACTCTGCTTTCTTCAGTACAGGGACTGCAMGCATC 
TAAGACTAACAGACACATTCTTAMGAGMCATATGMTATGATGACATCGCACMGTC 
25 TGGAMGCTGMGGAMCTTCCMTGTTAGGCTTTTCATTTAGTTTCACAGMC1X3CTCTTTTTG 
ACGTCMCAACGMGTAMTATGTTACTTAMTTTTCTTTCCTGTAAMTGGTTA 
TGATTTTATCTTGTAATTTATATTTGAMTC 
CCATTGGACTTTGTTCTCCAAAAGCTGTGTATCTGAGACCATTTG 

TTATGTMGGTACMGATACTAMTATTTTAGGCTTTGCAGGCCGTMGGTTTCTGTCACAAATACrc 
30 TTCMGAGCAGCCAGTAGGCAAAAAMGTACAGTGGTGTACACAAAMGMGCAGTTGGACTCGGGAAGCTGAGGCAGGAGAAT 

CACTTGAACCGGGGAGGCAGAGGTTGCAGTTMGCCGAGATCATGCTACTGCACTCTAGCCTGAGCGACAGAGTGAGACTCTGT 

CTCAAAAAAAAMGAGGCAGCTGGGCAGATTTGGCCTGCAGGGTATAGTO^ 

TAGMCTTTGTMTATTTTGGTTAGAMTATTTCAGTTMCTCCAGTTTTTTC 

TAGMCAAAGGGGGTAGTCCTCTTATMCCTGATATTTGTCAACACAGTATMGC 
35 GGTTTGTGTCTGTATTTGTATATTATACCTACAMCATACATATCTTTCAGCTACTG 

AGGTTGTTGATTTTTTTTMTTMCTA™ 

CTCATTGATTMGCCACTTATTTTTATMGGT^ 

TTACTGTTACAAMCTTTCTCTCTCCATGTMTCACACTTAGTTATGAGCAMGCAGTGAGAAAGTO 
TGCAGCATTAATGTMCAAAMGTTTCAGTATTTGTCAGGAAMCAAM^ 
40 TMGTTGAMTTCMGMTGTATATATMTTTCTTA^ 

TCATOTTTATTGCCATMCAMTTGTTTGGTCCAAMTGAMGCTATATTCAGM 
TAATGAAATGTCTGTACAAAATAAAGTGCAAGCAGTGT 

3 

MQMRRVWRTCRSGPRKRREARYKKSTEDIFPAQLLKLQRHERWQQEPPVRDHRSWGGSGAGGVA^ 
45 GEGYFMSEDEIiACSPYIPIX5GDFGGGDFGGGDFGGGDFGGGGSFGGHCLDYCESPTAHCNVL^ 
GI^PTLEI^PSLIVKWRRRLAEKRIGVRDWLNGSM^ 
GVNKEKITPLTLKEAYVQKl^VCNDSDRW 
HraiGESVYGDFQEAFDHLCNKIIATRNPEEIRGGGLLKYC^ 
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LQMF VGLEDRKYE YLMTLHGVVNESTVCLMGHERRQTLNL ITHLAI RVU^DQNVI PNVANVTC Y YQPAPYVADANFSNYY IAQ 
VQPVFTCQQQTY STWLPCN 

4 

ACMCTGCTAMGCTCCAGAGACACGAGCGTGTGTGGCAGCMGAGCCGCCAGTTCGGGACCACCGCAGCTC 
5 CGCAGGAGGGGTCGCGGGGCGGGAGTGGTGAGCGCAGGC^ 

CGCTCGCAGCTCCTGGMGTGTTGCCGCCTCTCGGTTTCGCTCTCGCTCGCTGCGC 

TGACCAGGGCCAAGTGGCGCTCGGCGGGCACTACATGGCGGAGGGTGMGGGTACTTCGCCATGTCTGAGGACGAGCTGGCCTC 

CAGCCCCTACATCCCCCTAGGCGGCGACTTCGGCGGCGGCGAC 

CGACTTCGGCGGTGGCGGCAGCTTCGGTGGGCATTGCTTGGACTATTGCGAMGCCC 
10 GGAGCAAGTGCAGCGGCTGGACGGCATCCTGAGTGAGACCATTCCGATTCACGGGCGCGGCAACTTCCCCACGCTC 

GCCGAGCCTGATCGTGMGGTGGTGCGGCGGCGCCTGGCCGAGMGCGCATTGGCGTCCGCGACGTO 

AGCCAGCCATGTCCTGCACCAGGACAGCGGCCTGGGCTACAAGGACCTGGACCTCATCTTCTGCGCCGACCTGCGCGGG^ 

GGAGTTTCAGACTGTGMGGACGTCGTGCTGGACTGCCTGTTGGACTTCTTACCCGAGGGG^ 

ACTCACGCTCMGGMGCTTATGTGCAGAAMTGGTTAMGTG 
1 5 CMCAGTGGCAAAMTGTGGMCTGAMTTTGTGGATTCCCTCC 

ATTAGACTCTCTTCTGCTCTTTTATGAATGTTCAGAGMCC^ 

CTATGGCGATTTCCAGGMGCCTTTGATCACCTTTGTMCMGATCATTGCCACCAGGMCC 
CCTGCTTMGTACTGCMCCTCTTGGTGAGGGGCTTTAGGCCCGCCTCTC^ 
CAGGTTTTTCATCGACTTCTCAGACATTGGAGAG 
20 AGACCGCMGTATGAGTATCTCATGACCCTTCATGGAGTGGTAAATGAGAGCACAGTGTGCCTGATGGGACATGAMGM 
GACTTTAAACCTTATCACCATGCTGGCTATCCGGGTGTTAGCTGACCAAMTGTC 

TTACCAGCCAGCCCCCTATGTAGCAGATGCCAACTTTAGCMTTACTACATTGCACAGGTTCAGCCAGTATTCACGTGCCAGCA 

ACAGACCTACTCCACTTGGCTACCCTGCMTTMGMTCATTTAAAMTGTCCTGTGGGGMGCCATTTC 

AGAAAAAAAAAAAAAAAAAAAA 

25 5 

MKFSRMPKSEGGSGGGMGGGAGGAGAGAGCGSGGSSVGVRVFAVGRHQm 
NMPDLOTCKREITIMKELSGHKWIVGYLIXAVNSISDNWEVLimEYCRAGQVVNQMNKKLQTG 
LHQCKTPI IHRDLKVENILLMD^ 

GCLLYKLCFFTLPFGESQVAICIX3NFTIPDNSRYSRNIHCLIRFMLEPDPEHRPDIFQVSYFAFKFAKKDCPVSNINKCCKQLL 
30 RHGALLTEILLFLQLFLNRMTASEAAARKSQI 

NHRPKGALRPGNGPEILLGQGPPQQPPQQHRVLQQLQQGDW^ 

HATQQQQMLQQQFLMHSWQPQPSASQYPTMPQYQQAFFQQQMLAQHQPSQQQASPEYLTSPQEFSPALVSYT 
MDSSYSANRSVADKEAIANFTNQKNISNPPDMSGWNPFGEDNFSKLTEEELLDREFDLLRSSKGHLKAYFASQ 

6 

35 AGCCGGGCAGCTGCAGCGGAGCCGCGGAGCGGGCGGCGGGGCCCAGGCTGTGCGCmGGGAGCGCGGAATGTC^ 

GGCCGCAGCACGCTCGGACGGGCCAGGGGCGGCGACCCCTCGCGGACGCCCGGCTGCGCGCCGGGCCGGGGACTTGCCCTTGCA 
CGCTCCCTGCGCCCTCCAGCTCGCCGGCGGGACCATGMGMGTTCTCTCGGATGCCCAAGTCGGAGGGCGGCAGCGGCGGCGG 
AGCGGCGGGTGGCGGGGCTGGCGGGGCCGGGGCCGGGGCCGGCTGCGGCTCCGGCGGCTCGTCCGTGGGGGTCCGGGTGTTCGC 
GGTCGGCCGCCACCAGGTCACCCTGGMGAGTCGCTGGCCGAAGGTGGATTCTCCACAGTTTTCCTCGTC 

40 MTCCGATGTGCATTGAAGCGMTGTATGTCMTMC 

GCTATCTGGTCACAAAMTATTGTGGGCTATTTGGACTGTGCTGTTMTTCAATTA 
MTGGAATATTGTCGAGCTGGACAGGTAGTGMTCAMTGAATAAGAAGCTACAGACGGGTTTTACAGM 
GATATTCTGTGATACCTGTGAAGCTGTTGCMGGTTGCATCAGTGTMGACTCCMTM 
TATTTTGTTGMTGATGGTGGGMCTATGTACTT^ 

45 AGTTMTGTAGTAGAAGAAGAMTTAAAMGTATACM^ 

ACCCATCACCACCMGGCTGATATCTGGGCACTGGGATGTCTACTCTAT^ 
TCAGGTTGCTATCTGTGATGGCAACTTCACCATCCCAGACAATO 
GCTTGMCCAGATCCGGMCATAGACCTGATATATTTCMGTGTCATATTTTGCATTO 
CTCCMCATCMTMGTGTTGTAMCMTTACTGAGACACGGAGCACTGTTMCTGAAATTCTTCTA 
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gmccgatgactgctagtgmgcagctgctaggaaaagcc 

tcmttgcaccaagacaaagaccamggccmctctgctactactgccactcccagtgtgctgaccato 
cctgttamgtccttgctcctggtgmttcggtmccatagacca 
ttgggtcagggacctcctcagcagccgccacagcagcatagagtactccagcaactac 
5 ctccatttacagcatcgtcatcctcaccagcagcagcagcagcagcagcagcagcagcagcaccaccaccaccaccaccaccac 
ctacttcaagatgcttatatgcagcagtatcmcato 

gtatatcaaccacmccttctgcatcacagtatcctacmtgatgccgcagtatcagcaggcttcctw 

gctcmcatcagccgtctcmcaacaggcatcacctgmtatcttacctcccctcmga 

acttcatcacttccagctcaggttggmccatmtggactcctcctatagtgccm 
10 gcaaatttcacaaatcagmgmcatcagcaatccacctgatatgtcagggtggmtc 

ttmcagmgaggmctattggacagagmtttgaccttctmgatcmg 

taamtaacagctctattattattcagcaaggccaaagacttttgag 

aaaattcaggccatcttcttatacatatactatcamttatgttgtc 

cccaatattgctgatagtatgctmtatcctaamcttamtattgcatatctatgaatg 
15 atggaamttgatgtttcaataaamgggagacattttattattttgccttactm 

ctcmtagaamtattgtgggtctgagatgccctttgaamtgcctgaaagaamcatggg 

gtaatgtccttttmtgagtggtgacataaaagggctgctttgttgtc^ 

.ttaaaaaaaaaaaaaaaaaaaaaaaaa . 

7 

20 MQWRAL\HjGLVLLRI£LHGVL^^ 

KAPRRPGPGMCGPANWGYVLGGRGRGPDEYEKRYSGAFPPQLRAQMRDLARGMFWGYDNm 

DPSNLNINDVLGMSLTLVDALOT^ 

IKDYDNELLYMAHDLAVRLLPAFENTKTGIPYPRVNL^ 

WNLRSNDTGLLGNWNIQTGHWGK^^ 
25 \nWMFSGQLl^TWIBSLQAFFPGLQVLIGDVEDAICLHAFYYAlWKRYGALPERm\IQL^^ 

TKNPmHVGMDILQSLEKYTKWCGYATLHHVIDKSTEDRMESFFLSETCKYLYLLFDEDNPVHKSGTRYMFTTEGHIVSVDE 

HLRELPWKEFFSEEGGQDQGGKSVHRPKPHELKVINSSSNCNRVPDERRYSLPLKSIYMRQIDQMVGLI 

8 

ggtggtcggcggggaggcccccgcgctttaaaataatgcccgcggcgcccgcgcgaccatgcaatggcgagcgctcgtcctggg 
30 gctggtgctcctccggcttggcctccatggagtamtggctcgtcttcgggc 
tccgctcagcttcggcttccagcgtctgaggagccccgacggccccgcgt^ 

ggtatccgggccgtcgtggctgcagccgccggggaccggggcagcgcagagcccgcgcaaggctccgcggcgtcctgggccggg 

gatgtgcggcccagccmctggggctacgtgctgggcggccggggccgcggcccggacgagtacgagaagcgctacagcggcgc 

cttccctccgcagctccgtgcccagatccgcgacctggcacggggcatgttcgtcmgg 
35 cttcccccaggacgagctcmccccatccactgccg 

tgtactagggmctactcattgactcttgttgatgcattggatacacttgcmt 

agtcmgttagtgatcaacacagtttcatttgacamgattccaccgtc 

cctcctttctgctcacagmtmtmctgact^^ 

atacatggcccatgacctggcggtgcggctcctccctccttttgaamcacc 
40 amgacaggagttcctcctgacaccmtmtgagacatgcacagcgggagccggttc^ 

tcgactcctgggggactccacatttgagtgggtggccagacgagcagtgamgccctttgg 

amctaggcmtgtcgtgmcattcagacgggccactgggttggamgcagagtggcctc 

tgaatacctcttgamtcttacattctctttggagaaam 

ctacttmgmgagggcgggmgcctgcaatgaaggagmggagaccctccactctatgtcm 
45 gctgatgmcacctggattgactctctcx:aggcctttt^ 

ccttcatgccttctactatgccatatggamcgatatggtgccctc 

tctcttctacccactgagaccagagttagtggaatccacatatctc 

aggmtggatattctgcagagtctggaamgtacacaamgtcmgtgtgggtacgccacgctgc^ 

cacagaagaccggatggagagcttctttctcagtgagarctgtamtatttc 
50 caagtctggaaccagatacatgttcacmcagagggacacattgtatc 
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ATTCTTCTCTCMGAGGGAGGGCAGGACCAAGGGGG^ 

CAGCTCCMCTGCMTCGTGTACCTGATGAGAGG 

GGTTGGTTTGATTTGATCTGCTCTCTGTGAGGCCTCATCTTGAACCA 

gtctgamtgamggggacagmgtcttgctgtccatggtggtgtaggm1ttctc 
5 cttgcacacttcagtgtttctctcot3ttcmtaamtgccctc 

ccctcttatgac^tgttgatgttatmgcacmtagatggggcatctttggattc 

agtctcttcactmctggcacctgctatactggagtattgctatgtcttta 

gagacagggtcttgatatttttttc^ 

tacaggtgagcaccactgtacctggctagctacttctttg™ 
10 atttggtatccctatttmttgcattgaamtgtcatcctttctg 

gattttggatcacatgggagaaamgtcatccagtttto 

tmgmgmtggtcttmccagaattcttmcagatagtctc 

ttggatggtattttgcattttgmtatgaacttacctgaggmct 

mtacctmgattgtctgagcttccatctttctcatatttcctmgcmggattctca 
15 ctgttttgtttctgttttctgtgtctagtgc 

agccatttctctgtgtaatacaggctmgattagtgcc 

atggtttcaggaggcctgtttagccacatggtgagaccgtggtgamggggg 

tcctgtaamgtmgcatgtagmggaggmgttgtgctaamtgcc 
. tctctttgmctcacactgatacacacctgctactot 
20 catgggtggaaacgcattctagtaaaaaaggtaggaaatccctaaaacttccagcctcacatagcacggttctcacctgtcact 

gttttcccacctctaaggatttcatgtacatcttttcamgctagam 

amgggagamtcttattccttcttgaamttttaagtc 

attttcagtmtamcgtgccatctctatctctt^ 

attttaccttttttcatttcagmgcattattctgtt 
25 atatatatattttamtgttttcactgactcattgaaaatgttmtta 

tacctttgtctctgggcamcaggtgggactgttagtc^ 

cctgcctgtgatttctcttggcgctcccctcctctcccgctctggcttctgtggcggcagtg^ 

tcttaatgaggcactttgcctgtcactcgagcmgcctgggtgttccttcctcctcatgctcctggmta 

catgcttgcamctacacmtgctgcaggtgcttcccaggggccacaggctgtcaggamcgtgttttatgttm 
30 ccacttgacttctgggtactggmttmtaccagtgggtgagactgagggtgagtgagm 

agcttccagactaccccgtccamgtttgatgctatgtagtcagtggtttgtc^ 

TTTCAMGGTTACTTGTTTTTTTTTTTTTTTTTT 

tcttctgttctgtgttctgamtcctggg^ 

cccaggattgcctcamtctgagtggacttcatcctt^ 
35 agtgtctagtgtcagctttgctcmtgtggtggamcattttc 

agcatmttctctcatmcaaactttcaaatcattacagtagcttagctactttagttgatc 

atcataggtggcaagggagggtttgctagctctccatttgcactggccattgtg 

ttcatttttttamtttttt^ 

agatcatggactgtgcacgtgacacttamtmttttctatc 
40 ctattagaggagcgtctgtagmgtacctggtttggtcagtc^ 

cctotccccaccamggaamtatccctcttmtgatttcg 

ggattgtgtagactcmcatgagacattcctttctgctttc 

ctgttgacaaaaacaaamtcttttttcamtgtagtgctggtgaam 

tgagcaggmctttagaagaaaatcctgagctttcctc^ 
45 ggcagggaccccttgggamtcgaggaggtgggacgggctgggccctgtc 

gcttgmtctggacgtgmtctggtaaamtatc^ 

ggcagaggmtaatatttttamggttattttgtttta 

agamggtamtctttttacaaaaaamgtatagagttggamctctgggaamctta 

atgtgcmtatgctttgcaactgtagatgatattttatgtt-tmtctgtamtm 
50 ttgtaamggaccamtgttcttttatamtgtmtmggmtatc 

tttattaaaagatttctttttttg 
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9 

MCEIMPLQSSQEDERPLSPFYLSAHVPQVSNVSATGELLERTIRSAVEQHLFDVNNSGGQSSEDSESGTLSASSATSARQRRR 
QSKEQDEVRHGRDKGLIMENTPSGFNHLDDCIL^ 

RSHERTGPDDFEWMSDERKGNEKIX5GHTQHFESPTMKIQEHPSLSDTKQQRNQDAGDQEESFVSEVPQSDLTALCD 
5 PAFSSWQRENSDSDEAHLSPQAGRLIRQLLDEDSDPMLSPRFYAYGQSRQYLDDTEVPPSPPNSHSFMRRRSSSLGSYDDEQED 
LTPAQLTRRIQSLKKKIRKFEDRFEEEKKYRPSHSDKAMPEVLKWTNDLAKFRRQLKESKLKISEEDLTPRtiRQRSNTLPKSF 
GSQLEKEDEKKQELVDKAIKPSVEATLESIQRKLQEKRAES 

MPLYDRYRLVKQILSRMTIPIIGSPSSKRRSPLLQPIIEGETASFFKEIKEEEEGSEDDSNVKPDFMVTLKTDFSARCFLDQ 
FEDDADGFISPMDDKIPSKCSQDTGLSNLHAASIPELLEHLQEMREEKKRIRKKLRDFEDNFFRQNGRNVQKEDRTPMAEEYSE 
10 YKHIKAKLRLLEVLI SKRDTDSKSM 

10 

AMTGTAGAGMGCAGCCGATAAMTAGCATTGCCTGMGMGTTTGGAGGCTGAGAGCAGCAGTAGACTGGCC 

CMGTTGTTTCTCCAGCCGTGCGGTGCAGCCTCATGCCCCCMCCCAGCTTAGCCACTGTM 

ACCAMCTTGCCGTGGMGAGACAGTTGTGAGATTCCCTOCAMTTTACATACGAGMTGGCTTC 

15 MGTTCACAGGAAGATGAAAGACCTCTGTCACCTTTCTATTTGA 

AGMCTCTTAGAAAGMCCATCCGATCAGCTGTAGMCMCATCTTTTTGATGTTMTAACTCTGGAGGTC 
CTCAGMTCTGGMCACTATCAGCATCTTCTGCCACATCTGCCAGACAGCGCCGCCGCCAGTCCMGGA 
ACATGGGAGAGACMGGGACmTCMCAMGAAMTACTCCTTCTGGGTTCMCCACCTTGATGATO 
GGMGTCGAAMGGTACACAAAAATACTTTTGGTTGTGCTGGAGAMGGAGCMCKZC 

20 TTCTGAGCTTCATGACMTCAGGACGGTCTTGTGAATATGGAMGTCTCAATTC 
TGATTTTGMTGGATGTCTGATGAMGGAMGGAMTGAAAM^ 

GATCCAGGAGCATCCCAGCCTATCTGACACCAMCAGCAGAGAMTCMGATGCCGGTGACCAGGAGGAGAGCTTTC 
AGTGCCCCAGTCGGACCTGACTGCATTGTGTGATGAAAAGMCTC^ 

GAACAGTGACTCTGATGAAGCCCACCTCTCGCCGCAGGCTGGGCGCCTGATCCGTCAGCTGCTGGACGAAGACAGCGACCCCAT 
25 GCTCTCTCCTCGGTTCTACGCTTATGGGCAGAGCAGGCAATACCTGGATGACACAGAAGTGCCTCCTTCCCCACCAAACTCCCA 

TTCTTTCATGAGGCGGCGAAGCTCCTCTCTGGGGTCCTATGATGATGAGCMGAGGACCTGACACCTGCCCAGCTC 

GATTCAGAGCCTTAAAAAGMGATCCGGMGTTTGMGATAGATTCGAAG 

AGCAGCCAATCCGGAGGTTCTGAMTGGACAMTGACCTTGCCAAATTCCGGAGACMCTTAM 

TGMGAGGACCTMCTCCCAGGATGCGGCAGCGMGCMCACACTCCCCMGAGTTTTC 
30 GMGMGCMGAGCTGGTGGATAMGCAATAAAGCCCAGTGTTGMGCCACATTGGMTCTATTCAGAGGMGCTCCAGGAGM 

GCGAGCGGAMGCAGCCGCCCTGAGGACATTMGGATATGACCAMGACCAGATTGCTMTGAGAMGTGGCTCTGCAGAM 

TCTGTTATATTATGAAAGCATTCATGGACGGCCGGTAACAAAGAACGAACGGCAGGTGATGAAGCCACTATACGACAGGTACCG 

GCTGGTCAMCAGATCCTCTCCCGAGCTMCAC^^ 

GCCMTTATCGAGGGCGAMCTGCTTCCTTCTTCMGGAGATAMGGAAGMGAGGAGGGGTCA 
35 GCCAGACTTCATGGTCACTCTGAAMCCGATTTCAGTGCACGATGCTTTCTGGAC 

TTCCCCMTGGATGATAAMTACCATCAAMTGCAGCCAGGACACAGGGCTTTCAMTCTCCATGCTGCCTC 
CCTGGMCACCTCCAGGAAATGAGAGMGAAAAGAAMGGATTCGAMGAAACTTCGGGATm 

GMTGGMGAMTGTCCAGMGGMGACCGCACTCCTATGGCTGMGMTACAGTGAATATMGCACATAAAGGCGAAACTGAG 
GCTCCTGGAGGTGCTCATCAGCMGAGAGACACTGATTCCMGTCCATGTGAGGG^ 
40 GCGGTGAGAGTTTACTGTCCCCAGAGAMGTGCAGCTCTGGMGGCAGCCTTC 
GCCTCTAGACCATTTC&CATCGGCTCCTGTTTCCATTGCCTC^ 

GCATTTTGTMWGGAMGTCMGACTGCAGTATGTGCACATGCGCACGCGCATGCACGCACACACACACACAGTAGTGGAG 

TTCCTMCACTAGCAGAGATTMTCACTACATTAGACMCACTCATCTACAGA 

GAAACMGAGACCATTCTCTGTCTMCTGTGATAAAAACMGCTC 
45 ATGCTCTCCTTGGACCCAGTTMCTGCAMCGTG^ 

GCTGCGCCTTCCTCACGTGTGTGAMGGTTTTCCCCTTCAGCCCTCAGGTAGATGGM 

AGTCATCATCTTCAGGATGTTTCTTCAGGACTTCCTC^ 

AGGACAGAGAGATGGTMGCAGCTGTATGMTGCTGATTTTAAMCCA 

CACTGACTGCACTTACCAGTCTGATTTTATCGTC 
50 TGTTGTGGCACTAGCCAMCATAMGGGGCTTAAGTCAGCCTGCATACAGAGGATCGGGGAGAGMGGGGCCTC 

CTCCTGAGTACTTACCAGAGTTTMTTTTTTTAAAAAAMTCTGC^ 
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AGCTCAGCCCMGGCAGMTCTAAATCACACTATTTTCGAGATCATGTATAAAMGA 
ATTATMTTTTTTTCAAAGACTTTC 

CTCCATCTCTTCAGTGTGCCTGATGTCACCTCATGATTTGCTGTTACTTTTTTMCTCCTC 
TCCACCTTTGTGCTTTGCGAGGCCGAGCCCAGGCATCTGCTCGCCTGCCACGGCTGACCAG 
5 CTTAGACGACGTGTTACAGTATGMCACACAGCAGAGC^ACCCTCGT^ 
MGGAAAAGAAAMGAATGTAAMCTATACTGACCCGTTTTCAGm 
TACAGCMCATTTTCCATTGCTGMGTGAGGTAGCAGCTCTCTTCTGTCAGCTGMTGT^ 
TAAGTTTGCTCTTMTCGTATGGMGCTTGAGCTATGTGTTGGMGTGCCCTGGTT 
CTACAGGTTTAMTGTACATAAAMTATAGTTTGGMTTCTTTGCTCTACrc 
1 0 TGAGATTATAMTAAAATGATGCACTTTAGGATC 

TGTTTTTTTAAAMTACATGTCTAGTCTGTCCTTT^ 
GCTTTTAMGCACATTTGTTTMGACTATGTTT^ 
CTACTMTTGTTTTGGAATCTGTTGTTTCTGCCAMGGTAAATT 
GATTCAATAAAAGAAAACACCAAGTAAGTTATATAAAAT 

15 11 

MLKGVQRIOIADKYWEYTFKVNWSDIjSVTTVTKTHQELQEFLLKLPKELSSETFDKTILRALN 

SSSSQAFLQSQKVHSFFQSISSDSLHSimQSSLKTSKILEHLKEDSSEASSQEEDVLQHAIIHKKHTGKSPIVNNIGTSCSP 
LDGLTMQYSEQNGIVDWRKQSCmQHPEH^ 
HGGTWDVNLDIGSGHDTCGETSSESYSSPSSPRHIX5RESF 
20 ASLGNENGNLLEDPLNSPKYQHISFMPTLHCW1HNGAQRSEVWPAPKPADGKTIGMLVPSPVAISAIRESANSTPVGILGPTA 
CTGESEKHLELLASPLPIPSTFLPHSSTPALHLTVQRLKLPPPQGSSESCTVNIPQQPPGSLSIASPNTAFIPIHNPGSFPGSP 
VATTDPITKSASQWGLNQMVPQIEGNTGWPQP 

ASAGISQAQATVPPAVPTHTPGPAPSPSPALTHSTAQSDSTSYISAVGNTNANGTWPPQQMGSGPCGSCGRRCSCGTNGNLQL 
NSYYYPNPMPGPMYRVPSFFTLPSICNGSYIjNQAHQSNGNQLPFFLPQTPYANGLVHDPVMGSQAl^GMQQMGFGRFYP 
25 PNWANTSGSGPKKNGIWSCYNCGVSGHYAQrX:KQSSMEANQQGTYRLRYAPPLPPSNDTLDSAD 

12 

ATCGGAAMGCACCTTGAGTTACTGGCTTCCCCTTTAC 

TCTTACAGTTCAGAGGCTAMGTTGCCACCACCACAGGGATCTTCTGAGAGCTGCACAGTTAACATCCCACAACAACCACCCGG 
MGCCTGAGCATCGCATCACCAMCACTGCCTTTATTO 
30 GGACCCCATCACAAAATCTGCATCCCAAGTGGTAGGACTCAATCAAATGGTGCCTCAAATTGAGGGAAACACAGGGACAGTCCC 
TCAGCCTACCMTGTGMGGTAGTTCTTCCAGCAGCTGGCCTCTCAGCTGCTCAGCCACCAGCTTCCTACCCCTTACCAGGCTC 
TCCCCTTGCTGCCGGCGTGTTACCCAGCCAGAACTCCAGTGTGCTCAGCACAGCAGC 
TATCAGCCAGGCCCAGGC^CTGTTCCTCCTGCAGTTCC^ 

ACACAGTACCGCGCAGAGTGACAGCACCTCTTACATCAGTGCTGTGGGGAACACGAACGCTAATGGGACAGTAGTGCCACCGCA 
35 GCAGATGGGCTCAGGTCCTTGTGGOTCTTC 

TTATCCTMTCCMTGCCTGGACCMTGTACCGAGTCCCTTCATTCTTTACTCTG 
CCMGCACATCAGAGCAATGGAMCCMCTTCCTTTTTTTCTGCCTC 

CATGGGGAGCCMGCCMCTATGGCATGCAGCAGATGGCAGGATTTGGGAGATTCTATCCTGTATATCCAGC 

TGCCMCACCAGTGGTTCGGGGCCCMGAAGAATGGGMTGT 
40 CTGTMGCAGTCGTCCATGGAGGCCAATCMCMGGCACmCAGACTGAGATACGCACCTCCCCTCCCCC 

GTTGGATTCTGCAGACTGAMCGAGTAAAGCTTGCCTACTTMTACACTCMGTG 

MGGAMGGTATTTTGTTTCTrrcTCTATACATTTCCTAGA 

CAAMCMGAMGAATGCMTGCTTTTGAGCCTCTGGTCTCCTGGTTC 

ACCTTCAGACAMCTTAMTGTTGGTGCGTGCTTTO 
45 TTAAMCTGTGTATTTGACCTTTTTTTTACM 

CAGCTTTTCTCACTTTCACCCTAMCGACACTTCCTCCCCAGCCAGCCTCACT^ 

AGTGCATTCACCCCACTTTTGTAMCTGCTCT 

ACTCCCAAMTAGTGTGTATATATGTAATAMCAGCGTCACGTAMTACATATATGCAGTGCTTGTTC 
MTMGTGGMGAGAGAGGMGMGTCMCCATATGAMCTGAAAAMTATGACGTACGAM 
50 ACCMCTTTTTACTTCCATCATCCTTTTTTAGC^ 
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CTGTGTGCTTGTGTTTGTM 

TGMGATTTCTTCTTTCCCTGTACCAGCTGTTACAGTGTTAC 

gctatgggtttctaccatmgtcaggttgtttgttccctmcctgtctctcg 

atgcttgattatmtgtgamggcggmttctgagtgtgttmgatggtattm 
5 gctgctgtttttaaaaaaaaaaamgtttttt^ 

mgtgcactgaggttatctggmgattgggtgtattttttc 

gtcctaggggtcagggggtgggggtttcattttccattc 

gttggtatgcagcagaggmcataaacatttggtcttggttc 

aaamgmgcttmtttcatgcttcataagtagcattcatatttatagcaccaatc 
1 0 ggagttatggggmggggtgggtgtgaaggggtagatgamgctto 

attamtatattttamgatctgggtatttttggaccacattattam 

gagttttgataagccacttatgggccgcgttgtgmtcacttgc^ 

tgtactgtacataggaggtatgttaccttctccttatttgtatc 

mggccacttatatttctagmcagatogattttatgcmcctttt™ 
15 aaaaaaaaaaaaaaaaaaa 

13 

RIFGASPPRNI^IQKCASRTAAAMGSEDHGAQNPSCKI^R^ 

DIDDWI PAPIQQWTGQSGLFTQYNIQKK^ 

VAQWNIGSLRTILDMVERECGTIIEGVNTPYLYFGIWTK^ 
20 GSSQGCDAFLRHKMTLI SPI I LKKYGI PFSRITQEAGEFMITFPYGYHAGFNHGFNCAESTNFATLRWIDYGKVATQCTCRKDM 

WISMDVFVRILQPERYELWKQGKDLWLDHTRPTALTSPELSSWSASRASLKAXLLRRSHRKRSQPKKPKPEDPKFPGEGTAG 

MLLEEAGGSVKEEAGPEVDPEEEEEEPQPLPHGREAEGAEEDGRGKLRPTKMSERKKKSFGLLPPQLPPPPAHFPSEEALV^ 

PSPLEPPVLGPGPAMEESPLPAPLWVPPEVPSEELEMPRPIIPMLYWPRPGKAAFNQEHVSCQQAFEHFAQKGPTWKEP 

SPMELTGPEDGAASSGAGRMETKARAGEGQAPSTFSKLKMEIKXSRI^ 
25 DALRPLLSLQV^RMSFQAERKFNAAMRTEPYCAICTLFYPYCQALQTEKEAPIASLGEGCPATLPSKSRQKTRPLIPEMCF 

TSGGENTEPLPANSYIGDDGTSPLIACGKCCLQVHASCYGIRPELVNEGV^ 

VICAIAVPEARFLWIERHPVDISAIPEQRWKLKCVYCRKRMKKVSGACIQCSYEHCSTSFHVTCAHMGVLMEPDDWPYW 
TCLKHKSGGHAVQLLRAVSLGQWITKNRNGLYYRCRVI^^ 
RWTDGNLYKAKFISSVTSHIYQVEFEDGSQLTO 
30 EDSGRSQDYVAFVESLLQVQGRPGAPF 

14 

GTCGCCAGCAACCGAGCGGGGCACGCCCGAGCGGGGCCTGGGGGTGCGAGCCGAGGGCGGGGGAGAGCGCGCCGCTGCTCCCGG 

ACCGGGCCGCGCACGCCGCCTCAGGMCCATCACTGTTGCTGGAGGCACCTGACAMTCCTAGCGMT1TITC 

CCCAGGMCCTCGCCATCCAGAAGTGTGCTTCCCGCACAGCTGCAGCCATGGGGTCTGAGGACCACGGCGCCCAGAACCCCAGC- 

35 TGTAAMTCATGACGTTTCGCCCAACCATGGMGMTTTAMGACTTCMCAMTACGTGGCCTACAT^ 

CACCGGGCGGGCCTGGCCMGATCATCCCCCCGAAGGAGTGGMGCCGCGGCAGACGTATGATGACATCGACGACGTGGTGATC 
CCGGCGCCCATCCAGCAGGTGGTGACGGGCCAGTCGGGCCTCTTCACGCA 
GAGTACCGCCGCCTGGCCMCAGCGAGAAGTACTGTACCCCGCGGCACCAGGACT^ 
MCCTGACCTTTGTCTCCCCGATCTACGGGGCTGACATCAGCGGCTCTTTC 

40 AGCCTCCGGACCATCCTGGACATGGTGGAGCGCGAGTGCGGCACCATCATCGAGGGCGTGAACACGCCCTACCTC 
ATGTGGMGACCACCTTCGCCTGGCACACCGAGGACATGGACCTGTACAGCATC^ 
TGGTACGCCATCCCACCAGAGCACGGCMGCGCCTGGAGCGGCTGGCC^TCGGCTTCTTCCCCGG 
GCCTTCCTGCGGCATAAGATGACCCTCATCTCGCCCATCATCCTGMGMGTACGGGATCCCCTTC 
GCCGGGGMTTCATGATCACATTTCCCTACGGCTACCACGCCGGCTO 

45 GCCACCCTGCGGTGGATTGACTACGGCAMGTGGCCACTCAGT 

TTCGTGCGCATCCTGCAGCCCGAGCGCTACGAGCTGTGGAAGCAGQ 

GCGCTCACCAGCCCCGAGCTGAGCTCCTGGAGTGCATCCCX^GCCTCGCTGAAGGCCMGCTCOTCCGCAGGTCTCACCGGAM 
CGGAGCCAGCCCMGAAGCCGAAGCCCGMGACCCCMGTTCCCTGGGGAGGGTACGGCTGGGGCAGCGCTCC 
GGGGGCAGCGTGMGGAGGAGGCTGGGCCGGAGGTOACCCCGAC^^ 
50 GAGGCCGA(^GCGCAGMGAGGACG(^AGGGGCMGCTGC(^CCMCCMGGCCMGAGCGAGCGGMGMGMGAGCTTCGGC 
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ctgctgcccccacagctgccgcccccgcctgctcacttcccctcagaggaggcgctgtc^ctgcc 
gtgctgggcccaggcccixkagccatggaggagagccccctgccgg 

gagctagaggccmgcctcggcccatcatccccatgctgtacgtggtgccgcggccgggcmggcagccttcaaccaggagcac 
gtgtcctgccagcaggcctttgagcactttgcccagmgggtccgacctggmggaaccagw 

5 CCAGAGGACGGTGCAGCCAGCAGTGGGGCAGGTCGCATGGAGACCAMGCCCGGGCCGGAGAGGGGCAGGCACCGTCCACATTT 

tccamttgmgatggagatcmgaagagccggcgccatcccctgggccggccgcccacccggtccccactgtcggtggtc 

caggaggcctcmgtgacgaggaggcatcccctttctccggggaggaagatgtc 

tctctgcagtggaagmcagggcggccagcttccaggccgagaggmgttcmcgcagcggc^ 

gccatcitcacgctcttctacccctactgccaggccctacagactc^^ 
10 ccggccacattaccctccaamgccgtcagmgacccgaccgctcatccctgagatgtc 

gagccgctgcctgccmctcctacatcggcgacgacgggaccagccccctgatcgcctg^ 

gccagttggtatggcatccgtcccgagctggtcmtgmggctggacgtgttcccggtc 

tgctgcctgtgcmcctgcgaggaggtgcgctgcagatgaccaccgataggaggtggatccacgtg^ 

cccgaggcgcgcttcctgmcgtgattgagcgccaccctgtggacatcagcgccatccccgagcagcggtggm 
15 gtgtactgccggmgcggatgmgmggtgtcaggtgcctc^ 

acctgcgcccacgccgcaggcgtgctcatggagccggacgactggccctatgtggtctccatcacctgcctcmgcacm 

gggggtcacgctgtccaactcctgagggccgtgtccctaggccaggtggtcatc^^ 

tgtcgcgtcatcggtgccgcctcgcagacctgctacgmgtgmcttcgacgatggctc^ 

agcatcacgagtagggactgtgtccagctgggacccccttccgaggggg 
20 tacmggccmgttcatctcctccgtcaccagccacatctaccaggtc^^ 

gc&gacatcttcaccctggaggaggagctgcccmgagggtccgctctcggctc^ 

gccttctcgggggaggaggccmggccgccaagcgcccgcgtgtgggcaccccgc 

gactacgtggccttcgtggagagcctcctgcaggtgcagggccggcccggagcccccttctaggacagctc 

CCCTCAGCCCCKjCGGCXjAGGCCATGGCATGCCCCGGGCGTTC 
25 GGCCACCTCCAAGCCGCGGGTGCCCCCTAGGGCGACAGGAGCCAGCGGGACGCCGCACGCGGCCCCAGACTCAGGGAGCAGGGC 
CAGGCGGGCTCC&GGGCCGGCCAGGGGAGCACCCCACTCMCTACra 
AGGTGCTACTGCMTGCCCTACTGAGCMCCTTOAGATTGTCACTTCTGTACATAMCC 
ATACATATTGTTTAAAAAAMGCMGAAAAAAAGGA 

AGAAAMCAAMCAAMCAAACACATTGTTTTTCTCAGMCCAGGATTCTCTGAGA 
30 GTTGTTTTAAMTATTATGATTTGGCTACAGACCAGGCAGGGAMGAGACCCGGTMTT^ 

aggacgccccggtttcggcacagcccggtcactcacggcctcgctctc^ 
ccagtgcctgtgcccactctgtgcctgctgggaggaggcccaggctctctggtggccgcccctgtgcacctg^ 
ccgggggtctggggcctccctccgtctgcgcccacctttgcagmtamctctctcctggggtttgtc 
acctgagagamccx:aggtgttccagaggcttccttgcagacaaagcacccctc^acctcctatggctca^ 

35 ccca(&cccttctgg™gtagtgagtgtggac^ 

cgaggcaggccttcggggcccctttcgcctgcttccgggcagggacgaggcctggtgtcctcgctccacccacccack 
tcacctgaggggmtctgcttcttaggagtgggttgagctgatagagaaaam 
tccaggtgcctctccctcaccagctctgcacccctctggggagccttccccaccttagctgtctcct 
aggagatmtttgcttatattaaamcaaaamtggctgaggcaggagtttgggaccagcct^ 

40 cactacamttttttacamttagctaggtgtggtggtgcgcacctgtg 
gcttgagtccaggaggttgaggctgcagtcagctcagattgcaccactg^ 
ccaaaaaaaaaaaamgcmtgtttatattataaaagagtgtcctmcagtccccgggc^ 
gagtgttacgcaggagcmgcctttcatttcctt^^ 

ggggggtgtcagccaamcgtggaggtgtccctctgcacgcagccctcgcccggcgtggcgctgaca 
45 ttgaamtgctatttatattgtamgmgcgggcgggtgcccctgctgcccttgtcccto 

ggctcctgggcggcctgcgcagatgggccacagaagggcaggccggagctgcacactctccccacgmggta^ 

actctgtgcamgacgcggcaaaacccagtgccctggtttt^ 

tccctgcctctaggttcataatgaattaaaggttcatgaacgctgcg 

15 

50 MTTTVATDYDNIEIQQQYSDVNNRTOTO^ 

GRMLIKLLEVLSGERLPKPTKGRMRIHCLENVDKALQFLK^ 
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NKEKKSAKDALLLWCQMKTAGYPNVNIHNFCT 

PEDI SVDHPDEKS I ITYWTYYHYFSKMKALAVEGKRIGKVLDNAIETEKMI EKYESLASDLLEWI EQTI I ILNNRKFANSLVG 
VQQQLQAFNTYRTVEKPPKFTEKGNLEVLLFTIQSKM^^ 
EQLARRFDRKAAMRETWLSENQRLVSQDNFGFDLPAVE^^ 
5 NVIRLWEYLLELLRARRQRLEMLGLQKIFQEMLYIMM 

ASAQKFATIX5EGYKPCDPQVIRDRVMMEFCYQELCQLMERRARLEESRRLWKFFWMAEEEGWIREKEKILSSDDYGKDLTS 
VMRLLSKHRAFEDEMSGRSGHFEQAIKEGEDMIAEEHFGSEKIRERIIYIREQWANLEQLSAIRKKRLEEASLLHQFQADADDI 
DAWMLDILKIVSSSDVGHDEYSTQSLVKKHKDVAEEIANYRPTLDTLHEQASALPQEHAESPDVRGRLSGIEERYKEVA 
RKQALQDTLALYKMFSEADACELWIDEKEQVifljN^ 
10 EIKAQQDKLNTRWSQFRELVDRKKDALLSALSIQOT^ 

EMLSDLQKEAEKLESEHPDQAQAILSRLAEISDWEEMKTTLKNREASLGEASKLQQFLRDLDDFQSWL 
TLTEAEKLLTQHENIKNEIDOTEEDYQK^^ 

OTKQAEAFLNNQEWLAHTEMPTTLEGAEAAIKKQEDFMTTMDANEEKINAW 

RETASELLMRLKDNRDLQKFLQDCQELSLWINBK^ 
15 PETEAWKEKLTGLHKMWEVLESTTQTKAQI^^ 

MEVRKKEIEELQSQAQALSQEGKSTDEVDSKRLWQTKFM^ 

HGHNLQTVQLLIKKNQTLQKEIQGHQPRIDDIFERSQ 
' . I YYFBAAEAEAWMSEQELYMSEEKMDEQSAVSMLKKHQILEQAVEDYAETVHQLSKTSRALVADSHPESERISMRQSKW 

AGLKDLAEERRGKLDERHRLFQLNREV^ 
20 LINSGHSDAATIAEWKIX5LNEAWADLLELIDTRT^ 

FEHDIQALGTQTOQLQEDAARLQMYAGDKADDI QKRENEVLEAWKSLLDACESRRVRLVDTGDKF R 

QIEAQEKPRDVSSVELLMMHQGIKAEIDARNDSFTTCIELGKSLLARKHYASEEIKEKIjIjQLTEKRKEMIDKWE 

LEVHQFSRDASVAEAWLLGQEPYLSSREIGQSVDEVEKLIKRHEAFEKSM^ 

PSPEPSTKVSEEAESQQQWDTSKGEQVSQNGLPAEQGSPRMETV^^ 
25 MTLPARTQETPS AQMEGFLNRKHEVffiAHNKXAS SRSfflNVYCVINNQEMGF YKDAKTAASG I PYH SEVPVSLKEAVCEVALDY 

KKKKHVFKLRLNDGNEYLFQAKDDEEMNTWIQAISSAISSDKHEVSASTQSTPASSRAQTLPTSVVTITSESSPGKREKDKEKD 

KEKRFSLFGKKK 

16 

gtgagctgmgcagggcagggcatcmctcacccaggmgtgcmggggtttggggattttccm 
30 tgacagactgtacctggaaaaacaggacactcttgcccamtactgcactttttgca 

attctctcctgtgcctgattcattgggtcccacacccatagggccttgcttactc^cagtgcag 

atccccgggagaactctmgaaggagctgatgtggaggagcagctgagacagttcaagatgacgaccacagtagccacagacta 

tgacaacattgagatccagcagcagtacagtgatgtcaacaaccgctgggatgtcgacgactgggacmtgagaacagctctgc 

gcggctttttgagcggtcccgcatcmggctctggcagatgagcgtgaagccgtgcag^ 
35 ttcccaccttgcccgtgtgtcctgccggatcacagacctgta 

ggtcctctctggagagaggctgcctamcccaccmgggacgmtgcgcatccactgcttagagmtgtc 

gttcctgmggagcagagagtccatcttgagmcat^^ 

catctggaccatcatcctgcgcttccagatccag 

tgcattgctgttgtggtgccagatgmgacagctgggtaccccmtgtcmcattcacm 
40 catc&ccttcmtgcactgatacaca^ 

CCTGCAGMTGCATTTAATCTGGCAGMCAGCACCTCGGCCTCACTAMCTGTTGGACCCCGMGACATCAGCGTGGACCATCC 
TGATGAGMGTCCATMTCACTTATGTGGTGACTTATTACCACTACTTCTCTMGATC 
MTTGGAAAGGTGCTTGACMTGCTATTGAMCAGAAAAMTGATTGAAMGTATGMTCAC 
GATOMCAMCCATCATCATTCTGMCMTCGCAMTTTGCCMTTC 
45 CACmCCGCACTGTGGAGAMCCACCCAMTTTAC^^ 

GAGGGCCMCMCCAGMGGTCTACATGCCCCGGGAGGGGMGCTCATCTCTGACATCMCAAGGCCTGGGAMG 
AGCGGMCACGAMGAGMCTGGCTTTGCGGMTGAGCTCA^ 

C^GGCAGCTATGAGGGAGACTTGGCTGAGCGAAAACCAGCGTCTGGTGTCTCAGGACM 
TGAGGCCGCCACAAAAMGCACGAGGCCATTGAGACAGACATTGCCGCAT^ 
50 CAGGGAGCTCGAGGCCGAGMTTACCACGACATCMGCGCATCA 

ACTGGMCTGCTCAGGGCCCGGAGACAGCGGCTCGAGATGMCCTGGGGCTGCAGAAGATAT^ 
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CKSACTGGATGGATGAAATGAAGGTGCTAGTATTGTCTCM 

GCACACCCTGGTTGMGCAGACATTGGCATCCAGGCAGAGCGGGTGAGAGGTGTCMTGCCTCCGCCCAGAA 
CGGGGMGGTTACAAGCCCTGTGACCCCCAGGTGATCCGAGACCGCGTGGCCCACATGGAGTTC 
GCTGGCGGCTGAGCGCAGGGCCCGTCTGGMGAGTCCCGCCGCCTCTGGMGTTCTTCTGGGAGATGGCAG 
5 GATACGGGAGMGGAGMGATCCTGTCCTCGGACGATTACGGGAAAGACCTGACCAGCGTCATGCGCCTGCTCAGCAAGCACCG 
GGCGTTCGAGGACGAGATGAGCGGCCGCAGTGGCCACTTTGAGCAGGCCAT^ 

CTTCGGGTCGGAGMGATCCGTGAGAGGATCATTTACATCCGGGAGCAGTGGGCCMCCTAGAGCAGCTCTCG^ 

GAAGCGCCTGGAGGAGGCCTCCCTGCTGCACCAGTTCCAGGCAGATGCTGATGACATO 

GATTGTCTCCAGCAGCGACGTGGGCCACGATGAGTATTCCACAeAGTCTC 

10 CGCCAATTACAGGCCCACCCTTGACACGCTGCACGAACAAGCCAGCGCCCTCCCCCAGGAGCATGCCGAGTCTCCAGACGTGAG 
GGGCAGGCTGTCGGGCATCGAGGAGCGGTATMGGAGGTGGCAGAGCTGACGCGGCTGCGGMGCAGGCACTCCAGGACA 
GGCCCTGTACMGATGTTCAGCGAGGCTGATGCCTGTGAGCTCTGGATCGACGAGMGGAGCAGTGGCTCMCM 
CCCAGAGMGCTGGAGGATCTGGAGGTCATCCAGCACAGATTTGAGAGCCTAGMCCAGAMTGAACMCCAGGCTTC 
TGCAGTGGTGMCCAGATTGCACGCCAGCTGATGCACAGCGGCCACCCMGTGAGMGGAMTCAAAGCCCAGCAGGACAAACT 

15 CAACACMGGTGGAGCCAGTTCAGAGMCTGGTTGACAGGAAGMGGATGCCCTCCTGTC 
CCTCGAGTGCMTGAMCCAMTCCTGGATTCGGGAAMGACCMGGTCATCGAGTC 
TGGCGTCATGGCCCTGCAGCCXTMGCTGACCGGCATGGAGCGGGACTTGGTGGCCATTGAGGCA^ 

GGAGGCGGAGAAGCTGGAGTCCGAGCACCCCGACCAGGCCCAGGCCATCCTGTCTCGGCTGGCCGAGATCAGCGACGTGTGGGA 
GGAGATGAAGACCACCCTGAAAMCCGAGAGGCCTCCCTGGGAGAGGCCAGCMGCTG^ 

20 CTTCCAGTCCTGGCTCTCTAGGACCCAGACAGCGATCGCCTCGGAGGACATGCCAAACACCCTGACCGAGGCTGAGAAGCTGCT 
CACGCAGCACGAGAACATCAAGAATGAGATCGACAACTACGAGGAGGACTACCAGAAGATGAGGGACATGGGCGAGATGGTCAC 
CCAGGGGCAGACCGATGCCCAGTACATGTTTCTGCGGCAGCGGCTGCAGGCCCTGGACACTGGATGGMCGAGCTCCACMGA^ 
GTGGGAGMCAGACAAMTCTCCTATCCCAGTCACATGCCTACCAGCAG 
TMCMCCAGGAGTATGTTCTGGCTCACACTGAMTGCCTACCACCTTGGMG^ 

25 CTTCATGACCACCATGGACX5CCMTGAGGAGMGATCMTGCTGTGGTGGAGACTGGCCGGAGGCTGGTC 

CMCTCAGATCGCATCCAGGAGMGGTGGACTCTATTGATGACAGACATAGGMGMTCGTGAGACAGCCAGTGM^ 
GAGGTTGAAGGACAACAGGGATCTACAGAAATTCCTGCAAGATTGTCAAGAGCTGTCTCTCTGGATCAATGAGAAGATGCTCAC 
AGCCCAGGACATGTCTTACGATGMGCCAGAMTCTGCACAGTAMTGGTTGMGCATCAAGCATTTATGGCAGAACTTGCATC 
CMCAMGMTGGCTTGACAAMTCGAGAAGGAAGGMTGCAGCTCATTTCAGAAAAGCCTGA 

30 GAMCTCACTGGTTTACATAAMTGTGGGMGTCCTTGMTCCACTACCCAGACAMGGCCCAGCGGCTC 
GGCCGMCTTTTCACCCAGAGCTGTGCAGATCTAGACAMTGGCTGCACGGCCTGGAG 
CAMCACCTGACCAGTGTCAATATCCTGCTGA^ 

AGAGCTCCAAAGCCAAGCCCAGGCCCTGAGTCAGGAAGGGAAGAGCACCGACGAGGTAGACAGCAAGCGCCTCACCGTGCAGAC 
CMGTTCATGGAGTTGGTGGAGCCCTTGAACGAGAGGAAGCATAACCTGCTGGCCTCCAAAGAGATCCATCAGTTCAACAGGGA 

35 TGTGGAGGACGAGATCTTGTGGGTTGGAGAGAGGATGCCTTTGGCMCTTCCACGGATC 

GCTGTTMTAMGAAAMTCAGACCCTCCAGAMGAAATCCAGGCX3CACCAGCCTCGCATTGACGACATC 
AMCATCGTCACTGACAGCAGCAGCCTCAGCGCTGAGGCCATCAGACAGAGGCTTGCCGACCTGMGCAGCTC 
CATTGAGGAGACAGAGAAACGCCACAGGCGGCTGGAGGAGGCGCACAGGGCCCAGCAGTACTACTTTGACGCTGCTC 
AGCCTGGATGAGCGAGCAGGAGCTGTACATGATGTCAGAGGAGAAGGCCMGGATGAGCAGAGTGCTGTC 

40 GCACCAGATCTTAGAACAAGCTGTGGAGGACTATGCAGAGACCGTGCATCAGCTCTCC 
CAGCCATCCTGAMGTGAGCGCATTAGCATGCGGCAGTO^ 

GAGMGAGGCAAGCTGGATGAGAGACACAGGTTATTCCAGCTCAACCGGGAGGTGGACGAC^ 
GGAGGTGGTCGCAGGGTCCCATGMCTGGGACAGGACTATGAGCATGTCACGATGTTACMGMCGATTCCGGGAG 
AGACACCGGGMCATTGGGCAGGAGCGCGTGGACACGGTCMTCACCTGGCAGATGAGC 
45 CGCCACCATCGCTGMTGGMGGATGGCCTCMTGMGCCT^ 
TGCCGCTTCCTATGAACTGCACMGTTTTACCACGATC^ 

TGAGGAGCTTGGGAGAGATCAGMCACAGTGGAGACCTTACAGAGMTGCACACTACATTO 
CACACAGGTGAGGCAGCTCCAGGAGGATGCAGCCCGCCTCCAGGCGGCCTATGCGGGTGACMGGCCGA 
CGAGMCGAGGTCCTGGMGCCTGGMGTCCCTCCTGGACGCCTGTGAGAGCCGCAGGGTGCGGCTGGT^ 
50 GTTCCGCTTCTTCAGCATGGTGCGCGACCTCATGCTCTGGATGGAGGATC 
GGATGTATCATCTGTTGMCTCTTMTGAATMTCATCMGGCATCAM 
CTGCATTGMCTTGGGAAATCCCTGTTGGCGAGAA 
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AMGAGGAMGAMTC^TCGACAAG^ 
CGCCAGTGTGGCCGAGGCCTGGCTGKTTTGGACAGGAGCCGT^ 

GAAGCTCATCMGCGCCACGAGGCATTTGAAMGTCTGCAGCAACCTGGGATGAGAGGTTC 

ATTGGAGTTACTGGMGTGCGCAGACAGCMGAGGMGAGGAGAGGMGAGGCGGCCGCCTTCTCCCGAGCCGAGCACGAAGGT 
5 TTCAGAGGMGCCGAGTCCCAGCAGCAGTGGGATACTTCAAMGGAGAACMGTTO 
ATCTCCACGGATGGCAGAAACGGTGGACACMGCGAMTGGTCMCGGC^ 
CCCCATCCCCTCCCCGACCTCTGATCGT^ 

(^AGACACCTTCGGCCCAGATGGMGGCTTCCTCMTCGGAMCACGAGTGGGAGGCCCACAATM 

CTGGCACMTGTTTATTGTGTCATAMTMCCMGAMTGGGTTTCTACAMGATGCAM 
10 CCACAGCGAGGTCCCTGTGAGTTTGAMGMGCTGTCTGCGMGTGGCCCTTGATTACAAAMGAAGAM 

AAGACTAMTGATGGCMTGAGTACCTCTTCCMGCCAMGACGATGAGGAMTGMCACATGGATCC^ 

CATCTCCTCTGATAMCACGAGGTGTC 

CGTCACCATCACCAGCGAGTCCAGTCCCGGCAAGC^ 

CAAAMGAMTGMCTCCTTTCCTTCACCTCCTGCCCTTCTCTTACCTTTTC^ 
15 CACATTACTCTCTGTGCCTMTGTTCCTCMTGTGGTTGATTTATTTTT^ 

G 

17 

MQPPLDLKQILPFPLEPAPTLGIiFSNYSamPVQKAVLSHTFGGPLLKTKRPVI 

IEMKTRGREPGVREPGDPAPPGSTPTNGIXiVAPRPVSMENGIiGPAPGSPEKQPGSPSPPSIPETGQGVTKGEGGTPAPASLPG 
20 GSKBEEEKAKRLLYCALCKVAVNSLSQLEAH^ 

EVQLKQHISSRRHRIX5VAGKPNPLLSRHKKSRGAGELAGTLTFSKELPKSLAGGLLPSPLAVMVMAAMGSPLSLRPAPMPL 
LQGPPITHPLLHPAPGPIRTAHGPILFSPY 

18 

AGAGCGACGCGGAACCCCGGGCGCCTGGGTCCCCAGCATGATCCTCGGCAGCCTGAGCCGGGCAGGGCCCCTGCCTCTGC 
25 GCAGCCCCCGATCATGCAGCCCCCACTGGACCTCAAGCAGATCCTGCCCTTCCCACTCGAGCCAGCCCCTACCCTTGGCCTCTT 
CAGCMCTACAGCACCATGGACCCTGTGCAGMGGCTGTGCTCTCCCACACTTTTGGGGGACCCTT^ 
CGT(MTTCCTGTMTATCTGTCAM^ 

ACGAGTCAMGGCATTGAGGCTGCCMGACCAGAGGCAGGGAGCCTGGCGTCCGAGMCCTGGAGACCCAGCTCCCCCAGGCAG 
CACCCCMCAAATGGGGATGGTGTAGCACCCCGTCCAGGTACTMGCACMGACMTTCTGGAGGCCC^ 
30 CATCAMGCTTACCCTCGGCTGGGGCCTCCCACCCCGGGGGMCCAGAGGCTCCTGCCCAGGACCGMCTTTCCACTGTGAGAT 
CTGCAATGTCAAGGTCAACTCGGAGGTCCAACTGAAACAGCACATCTCCAGCCGGCGGCACCGAGACGGCGTGGCCGGGAAGCC 
CAACCCACTACTGAGCCGTCACMGAAGTCTAGGGGC^CGGGGAGC 
GTCCCTGGCGGGCGGCCTGCTCCCCAGCCCCCTGGCGGTO 

CCCGGCTCCAGCCGCACCTCTTCTCCAGGGACCGCCGATCACTCACCCTCTGCTTCACCCGGCCCCCGGACCCATCC^ 
35 GCACGGACCCATCCTCTTCTCCCCGTACTGACCTCAACCCTGAACCCCTCCCATTCAACTCCCCACCTCCAGCCGGGACCCAGG 

CGTCCGGACTCCCAGCCCGCCCCTCCTCCCGGCACTCCCTAMTGATCTCTCCTTCCCCCCCCCCACCCCGAGATACGGG 

CAGGAMGGGGAGGGGTAGCGGGGGAGGGGGGCTTCAGMGGGGGGGMCACCCCAGATCTCAGGGMCCCCGCCCCCTGCCTT 

TCCCTCTCCCCTAGAAMGGGGGGGCCGTCTCACCCCC 

GCCCTTCCCCCCAMCCTAGCACAAAACGGGGTTC^^ 
40 ATMGCGGACTCTGGGACTCCGGCTCCCTACCCCAMCTGMGCGCTTCCGTGMCACCCCCGTCCTCCGTAGGGGGAGGGGAG 

CAGGCGGGATCCTGGGTCCCTCATMGCACTTTG^ 

CCAGGTCTAGCCGGGCCCATTGCAGGGGGCAGCACTTGGGGGCATCTCCGGCACTTGGGTGGGACCAAGGAGATGCCACCATAG 
ACCTTTCCCTCGCCTTCTTCCTCCCTAGTCCGGGTTCCATTCTTTTC 
AGGGGTGTCCAGTCCMGCCCACCCCCGCCTCGCCTTO 
45 CCCCTTCGCAGGATTCGCCGAGTCTGTAGCCTCCCCGAT^ 

CACTGCACAACTCGGGCGGGGGTGTGACACCTCTCCCCCACCCCCG 

ACCCGGGAGGGGTCTGGCCTACACTGGTCTTCCCCTTCCCATCMCTCTTTCTGC 

CCACGGCCCTCCCCITmCCTCTCCCTGA^ 
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MYKSLLDKAGLGSITSTOFLGDQQSVFVSKD^ 

SLDPSTQWFSATVVHGNPSSKTLQVNCEEIPALKIVDPALIHVEVW 

SPSMCPVQSVPTTVCKEILLGCTMTPSSNRQQMTQAMSPPNI^ 

LSSKSSQVGAGDLKILSEPKGSCIQPKTNTDQESRLESTPQPVTGLTKECLVTKTSSKAELDNATAPELQKRLEHTASTPDGLS 
5 DKPEVEAGWRLNSCSEKKVGPSDLGSQSQNLKETSVKVDHDSCCTRSSI^TQTPPARKSVLTDP 
VNIVAQLPKCRECRLDSLRKDKDQQKDSPVFC^ 
ILANIGDHFCQmSEKEAMSTIEPHRQVAWKRAVKGV^^ 

SWIRCWSQIHEPENLMPTQIIPGKALYDVGDIVHSVRAKWGIKANCPCSNRQFKLFSKPALKEDLKQASLSGE 
SSPVLEPAAVCGEAPSKPASNWPICPANTSPLNV^AD^^ 
10 • ILTPVSNNNSGFLRNLLNSSTGKTMGLKNTPK^ 

PMKSNVimWRECWKQGQPVMSGVHHKLNSELWKPESFRKEFGE 
mKLKDWPPGEDFRDMMPSRFDDLMM^ 

DAANVMVWGIPKGQCEQEEEVLKTIQDGDSDELTIKRFIEGKEKPGALWHIYAAKDTEKIREF^ 
DQSWYLDRSLRKRLHQEYGVQGWAIVQFLGDWFIPAGAPHQVHNLYSCIKVAEDFVSPEHVKHCFVi^ 
1 5 KLQVKNVI YHAVKDAVAMLKAVNPVLANLNLPAHWK 

20 

GGAGAGCATTTTTAGTAGAACATMTTTGGTTTTAGCTGM 
CMTAACGTACAAACCTCTGTTGGACAMGCTGGTTTGGGATCCATMCTTCTGTTC 
TTCTTTCTAMGACCTTTTGMGCCTATACAGGATGTAM 
20 MTTTCMGCTTTGATTGTGMGCATTTAGATGAMGCCATCTTTT 
TTTATAGCTTGGACCCATCTACTCAGTGGTTTTCAGCMCCGT 
GTGAGGAGATTCCAGCACTGAAAATTGTTGATCCGTCACTGATTCATC 

ATTCTGCAAGMTTGGAGCTGTAAMCGCMGTCTTCTGAGAATAATGGMCCCTGGTTTCCAMCM 

AGGCCTCTCCCAGTATGTGTCCTGTGCAGTCTGTACCTACMCAGTTTTTMGGAGATACTGCTO 
25 CACCTAGTMGGACCCMGACAGCAMGTACTCCCCAGGCTGCCAACTCTCCACCTAACCTTGGAGCAAAAATTCCTCAAGGAT 

GTCATAMCAMGTTTACCAGAGGAMTTTCTTCCTGTCTAAATACAMGTCTGM 

MGCAGGGTTGCTCTCAMGTCCTCTCAGATTGGMCTGGAGACTTGAAMTTCTGACTGAGCCAAM 

CTMGACAMCACTGATCAGGAAMCAGATTGGAGTCTGTTCCACMGCATTC 

AGGCTTCTTCTMGGCAGMTTGGAMTTGCCMTCCTCCTC 
30 TTTCAMTGCACCAGAAGTGAMGCAGGTGTCMTAGTGA^ 

TAGCTTGCCGATCACAGMTTCAMGGMTCTTCAGTAAMGTAGA 

AGMTGCCCCATCCAGGAAGTCGGTTTTGACAGACCCAGCTAM^ 

ATGATTCTOTGTGAACATCGTGGCACAGTTGCCTAMTGCCGAGAGTGTCGC™ 

AGMGGACTCACCTGTGTTTTGCCGCTTCTTTCACTTCAGGAGGTTACMTTCMC 
3 5 TCTTMCACCAAACMGTATGACMTGMGCMTTGG^ 

CAGCAAAGTACATCTTGGCCMCATTGGAGACCACTTCTGTCAM 

CACACAGACAGGTTGCTTGGMGCGAGCTGTCAMGGTGTTCGAGAM 

ACTGGGTGTGTCCTCGGTGTGGGTTTGGAGTATGTGTGGACTGCTACCGGA 

ACMGACTTTCTCTTGGCTAAMTGTGTGMGAGTCAGATACATGAACCAGAGMCTTMTGCCC 
40 MGCACTCTATGATGTTGGAGACATTGTTCATTC 

MTTCAMCTCTTTTCAMGCCAGCCTCAMGGMGA 

TGCTCCAGCAGAATCCCTCAGTGTTGGAGCCAGCAGCTGTGGGTGGGGMGC 

CCTGTCCAGCCAGCACATCTCCTCTAAACTGGCTGGCCGACCT^ 

CMCMTGCCMTTTTAAAGMTGAAATCAMTGCCTTCCACCCCTCCCACCm 
45 TTMCAGCACMTTTTGACACCCGTMGCMCMCMTTCTGG 

AAAATGGACTCMGMTACACCAAAMTCCTTGATGACATC 

AGAGGCCTCAAGGACTMCCATCMGCCCAGCATTCTGGGCTTTGACACTCCTCACTATTGG 
GCTTGCMGACCCCMCAATMGAGCAACTGGMTGTGTTTAGGGAGTGCTGGAMCAA 
TGCATCATAMTTGMCTCTGMCTTTGGAMCCTGMTCCTTC 
50 GTAGGACCMTGAMTCATCACAGGAGCCACAGTAGGAGACTTCTGGGATGGATTTGMGATGTTC 
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AAAAAGMCCMTGGTGTTGAMCTTM^ 
TGATGGCCAACATTCCACTGCCCGAGTACAC 

GGCCAGATCTGGGCCCCMGATGTATMTGCTTATGGATTMTCACTCCTGMGATCGGAMTA 

tagatgtatctgatgcagctmtgtcatggtctatgtgggmttcccamggacagtc 
5 ccatccmgatggagattctgacgmctcacaatamgcgatttattc 

atgctgcamggacacggagmgatmgggaatttcttaaamggtatcagaagagcaaggtcaagaaaacccagcagaccacg 

atcctattcatgatcamgctggtatttagaccgatcattmgaaaacgtcttcatcmgagta 

ttgtacagtttcttggggatgtggtgtttatcccggcaggagctccacatcaggtt 

ctgmgattttgtttctccagagcatgttamcactgcttctggcttactcaggm 
10 atcacgaagataaattacaggtgmgaatgttatctaccatgcagtgamgatg 

gttttggcamccttmtctccctgcacattggamtgmttacaggcagctgttcam 

tttgagattcatgttacctcatcttcttttttamctc 

tacagacagtamtgtgtatatgtagtmctatttacagmcatgcat^ 

taccaggctgtaaaagcaaaacctcgtatcagctctggmcaatacctgcagttattc 
15 ggtttatmctattaggmtcactgcacagtmm 

gmgagcmtggaggmgtgacagctmtgttgcagttcttattc^ 

ctgtacmtttcttggggttmccatctttagttamtggmttttm 

TTTTGCATTAAAATATTCATATAAT 
21 

20 MKAGGGMAGSVSGHSESLASLSRSPQTKAGQKLQHKNYPVCVLYIFITKFMQKPLLNQVILSSVKRN 
22 

CAGMGACTACATTAGTGAGATGTMGMTTATTAMTATTCCATTTCCGCTTTGGCTACMTTATG 
TCTTTTAGACCACCAGTAMTMTCCTCCTTCAAA 

CTAAAAAMCCTAAGGAAGMTMTMTATMGAAAGGAMTTTAAAMCAT^ 
25 " TACTTATGGTTATATCTTATATTCTCTATTCM 
AMTGTTTTCAAAAATTATAGCAGTAGTAGM^^ 

TTGATACGAATGTTATGCATTTAGTATGCACATTGMGTCTAMCTGTAGMGAGTC 

ACATACTMTGGTTTAATTCTGTGCTCTGTTTAMGTACTATTATMCTAGAGTAGATC 

GAGGMTGGMGAATGGACCTTGAMCTACCTAGGCTTTTATGCATGGCACCTCTTTATMTC 
30 GTTTTTGTTTCMTTACCGCTAGAm 

AMTTATTAAAATTCTAGTGTTTGGATTGGGCCCTTCTCTMCAGTACATACTC^ 

MCCATTTGGGTTTCAMTGTTMGMCACTAMTAGCATGATTTAAAAMTGAAAM 

TAAGTGCCTTTTMCAACTCCTAGAGTACAAMTGAGTACATCATMTGCTGGCTCTTCTACTM 

GMTAMTTATTTATCTTCTCAGTTTCCTTATCTGTAM 
35 mCCTTAGCKTTTTATMTGCTTC 

TMTGATTMTGCATTCTCCACATTTTMTATTGCAMGGCCCATTC 

CAMTMCTGTAAAGGMCTGAAMTCTTCACAGAGATGMGTGGGGTTTCCATTAGGTG 

CMCTTCCACTGGTCMTATATAGATTTTGGGTGTCTGAGGCCCCAAGATTAGATGCCACTMTCTC 

TATGAMTATTTOMTGTCTACTmAGAGA 
40 TACCTGGTTCAAAAGTGGTTTTTGTTTATTAMTTTGGTMTAMT^ 

TTTTTGGCCAGOTTTAGTTTGAGGACTCCTTGATMGCTO 

TCCTGCCTTCATTGGGGCAGACTTGCATTGCAGTCTGACAGTMOT 

TCAGTTTTTAAMGTCAMGTTAGATCMGAGMTATTTC 

AMGATTTACACMTAGATMTCATCTTAATGTCAMGATATTTGAAGTA 
45 AGTCTTCTGTATGGMTTTTGTCACTTMGATGAGCTGCAAATAMTM 

AAAAAAAAAA 

23 

MGNFRGHALPGTFFFIIGLWWCTKSILKYICKKQKRTC^ 
GHWNQLLGWHHFTMYFFFGLLGVADILCFTISSLPVSL^^ 
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LEFLVRNmLELLRSSLILLQGSWFFQIG 
RLCSSEVGLLKNAEREQESEEEM 

24 

GMGCACATCTGGACAGCTGTGCGGCCTCCTTGCGGGCCGACGTCAGCCGAGCACGTCCCCCACGTCCTCTCCTT 
5 TATTATTTATTCGTTTTCCCAMGMGCGACTAGGGACCCMGTTTW 

ATCCCATCCMCACACGGTTTMTTTTCATGGGGCTCTGGGATCAAAAGMCA 
CTGATTTTMGCTGGCAMGTGGGAAAMTAAAGTC 
CCTCCCTGGMCCTTCTTTTTTATTATTGGTCTTTGGTGGTC 
MCCTGCTATCTTGGTTCCAAMCATTATTCTATC 
10 CATGGCTGGGGAGCAGTTTATTCCTGGAGGGCCCCATCTGATGTTATATG 
■ CTGGCATCATTTCACCATGTAmCTTCTTTGGGCTGTTGGGTGTG 

GTCCTTMCCMGTTMTGTTGTCAMTGCCTTATTTGTGGAGGCCTTTATCTTCTACAACCACACTC 
GGACATCTTTGTGCACCAGCTGCTGGTTTTGGTCGTCTTTCTC 

TGTACTTCTGGAGCTATTGCGGTCMGTCTCATTCTGCTTCAGGGGAGCTGGTTCTTTCAGA 
15 CAGTGGAGGTCCTGCATGGGATCTGATGGATCATGAAMTATTTTC 
CATTGTCATCGTTGGMTGAATTATGCTO 

ACTTCTGAAAMTGCTGMCGAGMCMGMTCAGMGMGAMTGTGACTTTGATGAGCT^ 
TTCTTTTTTACATTGTTCTTGGTTTTGTTTCTC 
TGCATTTCCMTTTGGTTAAAGTATTTGMTTTAMTATTTTCTTTTTA 
20 CACATCATGCACATCATGGTATTCAGGGGCTAGAGTGATTTTTTTCCAG 

ATATTTGTTTTATTTGCCTTATAGATATGCTCAAGGTTACTGGGCTTGCT^^ 
TGTTTATCTOTTGCTGCMTGAGAAATAMTGM 

25 

MPSLWDRFSSSSTSSSPSSLPRTPTPDRPPRSAWGSATREEGFDRSTSLESSDCESLDSSNSGFGPEEDTAYLDGVSLPDFELL 
25 SDPEDEHLCANLMQLLQESLAQARLGSRRPARLLMPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPS 
LVPTFQLTLVLRLDSRLWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKKKLYSSEQLLIEEC 

26 

GCAGCAGGCCMGGGGGAGGTGCGAGCGTGGACCTGGGACGGGTCTGGGCGGCTCTCGGTGGTTGGCACGGGTTCGCACACCCA 
TTCMGCGGCAGGACGCACTTGTCTTAGCAGTTCTCGCTGACCGCGCTAGCTGCGGCTTCTACGCTCCGGC 
30 CAGCAMCGCCCTGGCGTCTGTCCTCACCATGCGTAGCCTTTGGGACCGCTTC 

CCTTGCCCCGAACTCCCACCCCAGATCGGCCGCCGCGCTCAGCCTGGGGGTCGGCGACCCGGGAGGAGGGGTTTGACCGCTCCA 
CGAGCCTGGAGAGCTCGGACTGCGAGTCCCTGGACAGCAGCAACAGTGGCTTCGGGCCGGAGGAAGACACGGCTTACCTGGATG 
GGGTGTCGTTGCCCGACTTCGAGCTGCTCAGTGACCCTGAGGATGMCAC1TCTGTGC 

GCCTGGCCCAGGCGCGGCTGGGCTCTCGACGCCCTGCGCGCCTGCTGATGCCTAGCCAGTTGGTMGCCAGGTGGGCAMGM 
35 TACTGCGCCTGGCCTACAGCGAGCCGTGCGGCCTGCGGGGGGCGCTGCTGGACGTC 

GCGTGGGCCAGCTGGCACTCGACCCCAGCCTGGTGCCCACCTTCCAGCTGACCCTCGTGCTGCGCCTG^ 

CCMGATCCAGGGGCTGTTTAGCTCCGCCMCTCTCCCTTCCTCCCTC 

GAGTCATCMGMGMGCTGTACAGCTCGGAACAGCIH^TCATTGAGGAGTGTTGMCTTCM 

CTCCAAGACAGAGACGACTGMCTTTTGGGGTGGAGACTAGAGGCAGGAGCTC 
40 GCAGCCACCTMGGTGGAGGTGGGGGMTAGTGTTTCCCAGGAAGCTCATTGAGTTGTGTC 

CATACCCCTCAGTACTGTAGCATGGMCAMGGCTTAGGGGCCMCM 

TTATTTTTGTTACTGACAGTTAACAGTGGTGTGACATCCAGA6AGCA 

GGMGAGGCACGTGCTCCTCAGAGCAGCCGGAGGGAGGGGGGAGGTCGGAGGTCGTGGA 

GMGGGACCMGTGTGTTTGTTGTTTGTTTTGTATCTTGTW 
45 CTTACAGACGCATGMTGTMGAGTAGGMGGGGTGGGTGTCAGGGATCACTTGGGATC 

GCAGCTGCGTTTMGCCTTCCCCCATCGTGTACTGCAGAGTTGAGCTGGCAGGGGAGGGGCTGAGAGGGTGGGGGC 

CTCCCCGGGAGGAGTGCCATCTGGGTCTTCCATCTAGMCTCT 

ATGTTCMGTATTMGACCTATGCMTATTTTTTAC 
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GGPGGPECPMRTPQULLQVFFLVFPOSWPQPSSSPSGAVPTSL^^ 
PGNRTVDLFPVLPICVCDLTPGACDI^^ 
FC^VNNSNLNYFQKLQKVmTNFQALMEFGGES^ 
PAGFLESKSTTCTRFFKNLASSCTLDSALNMSYYNFTVLKW^ 
5 IETNGTFGIQKVSVSLGQTNLWEPGASLQQHFILRFRAFQQSTMSLTSPRSGNPGYIVGKPLLALTDDISYSMTLLQSQGNG ' 
SCSVKRHEVQFGVNAISGCKLRLKKADCSHLQQEIYQTLHGRPRPEYVAIFGNADPAQKGGWTRILNRHCSISAINCTSCCLIP 
VSLEIQVLWAYVGLLSNPQAQVSGVRFLYQCQSIQDSQQOTE^ 
GVFSQKCSVSPILILCLLLLGVLNIiETM 

28 ' 

10 ggggggacccggaggccctgmtgccccatgcgcaccccacagctcgcgctcctcx:aagtgttctto 

cgtccggcctcagccctcttcctccccatcaggggcagtgcccacgtctttggagctgcagcgagggacg 

ccagtccccttcagaggcgactgcmctcgcccggccgtgcctggactccctacagtggtccctac^^ 

ccctgggaataggactgtggacctcttcccagtcttaccgatctgtgtctgtgact^ 

ctgctgcgacagggactgctatcttctccatccgaggacagttttctcct^ 
15 ggtttgtgtagacmctctgttatcttcaggagtmttccccgtttcc™ 

gttttgtgtccatgtgmcaactcamcttamctatttccagmgcttc 

tgcagagtttggaggcgmtcattcacttcmcattcc 

tacttacttccccmgtggtctgtaatmgcttgctgagacaacctc 

tcctgcaggtttcctagagagtaaaagtacmcttgcactcgtttt^ 
20 cctcmtotgcctcttactatmcttcacagtctt^ 

tcctgtmtacttacctcacaggctmtgctcctctgttggc 

gatagagaccmtgggacttttggmtccagamgtttctgtc^ 

cttacagcaacaotcatccttcgcttcagggcttttcmcagagc 

tggctatatagttgggmgccactcttggctctgactgatgatatmgttactcmtgaccctc™ 
25 aagmctctgttaamgacatgmgtgcagtttggagtgmtgcmtatctggatgcmgctcaggttgmgmggcagactc 

cagccacttgcagcaggagatttatcagactcttcatggmggcccagaccagagtatgttgc^ 

agcccagaaaggagggtggaccaggatcctcaacaggcactgcagcatttcagctatam^ 

agtttccctggagatccaggtattgtgggcatatgtagg^ 

ataccagtgccagtctatacaggattctcagcmgttacagmgtatctttgacmctcttgtgaagtttgtgg 
30 gaagccacagcctccmggggccmcccaamtggactggaaatggccattcgac 

aggagtattctctcaaamtgctcagtctctcccatcctta^ 

gtgmgaaaagaamtmtcagatttcagttttccctatgagamctctgaggcagccactta 

cctgctcatgaccagagagcatttaggataatagaggacctmctgaa 

gcmtaaamtattttattcatcatagctctctgcttto 
35 ctgggmggccmgagccttccttcagcctttctggttatgttacacctagctgaatgttt 

tcaggcacagttgggccmgcagamgagagamctcttctgctgra 

gtagaggagctaactgtttggmcagaamctgctggctgttgattttgtctg 

gcccagacacgagtggggamgcagttctttctcctcagtttccamgtamtgggg 

atgaccctaccatttatttctgcctttttcttc 
40 aaaaaaaaaaaaaaa 

29 

RSHFICDLDFFVLHYIFIDFVINMVLGKVKSL^ 
SRNAGSNTAYTQNYTEEVEYFNHKDILIGHER^^ 

WLLPVKLKKEFTWEHIDINTPSLLSPQAGTKEKTLCCWFCTSGPISLSMIERKGYTPGESIQIFAEIENCS 
45 QTQAFYAKGKMKEVKQLVANLRGESLSSGKTETWNGKLLKI P PVSPS I LDC S I IRVEYSLMVYVDIPGAMDLFLNLPLVIGTI P 
LHPFGSRTSSVSSQCSMNMNWLSLSLPERPEAPPSYAEVWEEQRRI^LAPVSACDDFERALQGPLFAYIQEFRFLP 
DPNPDQSADDRPSCPSR 
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GACCACTGAGACGAGCGGGAGCGCGGAGCAGCAGCCTCTGCTGCCCTGACTTTTTMGAMTCTCM 

ATCACTGATCCGGCCTGCAAGCATTTTGCACGGCAAAMTATCGATCAGTC 

ACTTTTTTGTCTTACATTATATTTTTATAGATTT 

ACTGTCTTMTGACAGCMTGTCCCTGTGTATTCTAGTGGGGATACCGTCTCAGGMGGGTAM 
5 TCAGAGTAAMTCTCTTAAMTTCATGCMGAGGACATG^ 

cctatacacagmttacactgmgaagtagagtatttcmccatamgacatcttmttgggcacgamgagatgatgatmto 
ccgmgmggcttccacactattcattcaggmggcatgmtatgcattcagcttc 
cattcgmggccgacatggcagtgtgcotattgggtgamgccgaat^ 
aggmmacagtctttgagcatatagatatcmca 
10 gttgctggttctgtacctcaggcccmtatccttmgtgccaamttgamgg 

TATTTGCTGAGATTGAGMCTGCTCTTCCCGAATGGTC^ 

GGAAMTGMGGMGTAAAACAGCTTGTGGCTMCTTGCGTGGGGMTCCTTATCATCTGGAMGA 

AGTTGCTGAAMTTCCACCAGTTTCTCCCTCTA^ 

ATATTCCTGGAGCTATGGATTTATTTCTTMTTTGCC 
15 CAAGTGTMGCAGTCAGTGTAGCATGMTATGMCTGGCTCAGTTTATCACTTCCTGAAAGA 

CAGMGTGGTMCAGAGGMCAMGGCGGMCMTCTTGCACCAGTGAGTGCTO 

CACTGTTTGCATATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTA 

ATGATAGACCATCCTGCCCCTCTCGTTGMGGMCACTTGGTTGMTCMGTTGATGTGGGT^ 

TGAGGACAGAGMGTATCTTGGAGACACGTTTCAGAGGAAGTGGMTTACTTTTGCCCAGAA 
20 CCAGTGATCATGCTTTAGMGCCTACAGCMCATTCTGAGACTGCTCCAACATGCTTGAAGATCTM 

CTGGCACATACTCAGAGCAGTCTTCTTAGCCTATGGTCGTACGTGTC^ 

TTGATTTGAAMTTTGCACATGCTCAATGCTTACATTC 

CTATTTTTGCCAGACTCTTGATACTCTTAAMCTTGTTTGTGGTCAG 

MCATGAAAAMCGCACTGACATTTTTTTTTATTTMTATAGCCTGGACT 

25 taggctgactatgtatcacctcttcagcttggatccmttgt 

ttgctgtatgttttgcagcctactgtagtagatacgcaacagatmtc 

agactgtcmgattgtataccttcttggtttcttttmgmtttgttgcct 

tgactgcctaamtcacttaatctcaggtgaacgcatcacttgccam^ 

ttttcgtttgtttgtttgtttatttc^ 
30 cattccctttttatcatgacatacaagaagaaactagcagagctaagaatggagtgaagaaaggcagtatggcaggcaccagca 

mgagttgagggctgttgctcttaaamttattttttttattattattttgam 

gagggaamgtgcatttatttttatacaga 

agtatattmtgagcaggmtacatacattgaggttatgaatagagagctcmtttgtaccto 
tatggcatgaamctcgacttoattccaamgtmcttcaa 
35 ttttgtgtttttctaatgagmtactgtttttcattacctamgm 
gtatagattacataggmgaacmtcacatcagtmgttatagtttat^ 

aggmttcagctctgatgtttttamgaamccagcatctctgatgttc 
gccatttmccccacagttmtamgtggctgaamtaatagt^^ 
40 agctcatacaamcaaataggcttttccatmgtggcctttmgaamcatggmgag 
ggtgmgamgcccagtgtaaamtgmtcgcgttttmgtgattcggttamgagtttgg 
gatmtmggamtgggggtgamtatttttttattgttgaatcattt 
ttggcatamgggcatttggtggttttattto 

tagggmcmttgttgatatgcatagcattggmtacttgtcattatatactcttacamtm 
45 atattctgatmttggcactggatcacaamtgtc^^ 
ctttaamtgtatttctttagaggcattacttttttaaam 
ttccagtattcaagatagattcctcatttttc 
tgtttgctttatagattatattctatggctgtttgtmtttc 
ggctctgttttmgaamcaatatgtgggamtgatttmtttto 

50 TTTTCTGTTTTGTAT 
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31 

MLGKGGKRKFDEHEI&LEGKIVSPCDGPSKVSTO 

MFTPSSQPTTEPSDSYREAPPAFSHLASPSSHPCDLGSTTPLEACLTPASLLEDDDDTFCTSQAMQPTAPTKLSPPALLPEKDS 
FSSALDEIEELCPTSTSTEMTMTDSWGTSSEAGTQKLDGPQESRADDSKLMDSLPGNFEITTSTGFLTDLTLDDILFADID 
5 TSf4YDFDPCTSSSGTASKMAPVSADDLLKTLAPYSSQPVTPSQPFKMDLTELDHIMEVLVGS 

32 

CTCCTGCACGGCGAGTGCTGGAGCACGAGCTACCGCTCGCTCGGTCAGG^ 
CTGCCTGCCGCCGCCGCCTCCACCATTGTATAATGCTCGGGGCGC^ 

TGCTGCCCGCTCCCCGGCGCCACCCTCGGGCCCCTGGAGCGGGGCACTCC(X:ATGGAGCGGGAGTA^ 
10 ACCCCTCCTGATGCGTTAGTTCCCAGGTGGAGCTC^ATGTGATATATCTTGGGT 

GMGATGGGCTGGMGGCAAMTCGTGTCTCCCTGTGACGGTCCATCCMGGTGTCTTACACCTTACAGCGCCA^ 
AACATTTCCCTTATGAAACTCTA^ 

AGGCGGATCCAGGAGGMCTCAMCAGGMGGCAGCCTGAGGCCCATGTTCACCCCCTCCTCCCAGCCCACCACCGAGCCCAGC 

GACAGCTACCGAGAGGCCCCGCCGGCCTTCAGCCACCTGGCGTCCCCGTCCTCCCACCCCTGCGACCTCGGMGCACTACG 
15 CTGGAGGCCTGCCTCACCCCGGCCTCACTGCTC^ 

. CCCACCAMCTGTCACCTCCAGCCCTCTTGCCAGAAMGGACAGTTTCTCCTCTGCC 

ACATCTACCTCCACAGAGGCGGCCACGGCTGCGACTGACAGTGTGAMGGGACCTCCAGCGAGGCTGGCACCCAGAMCTCGAC 

GGTCCTCMGAGAGCCGCGCAGATGACTCAAMCTGATGGACTCTCTGCCT^^ 

CTGACAGACTTGACCCTGGATGACATCCTGTTTGCTGACATTGATACGT^ 
20 GGGACAGCCTCAAAMTGGCCCCTGTGTCTGCCGACGACCTCCTCAAMCT^ 

AGTCAGCCTTTCAAMTGGACCTCACAGAGCTGGACCACATCATGGAGGTGCTTGTTGGGTC 

CTATGCCCACCCAGACCCCAGAGCGTTCCCATMC^ 

GAAAMGAAAATTTTACMCAGGATCACACTAGTTTTTGCTTT^ 

MTACACTTTTTTGAGTGGTTCCTCAGAGACCTAC^ 
25 ATGACAAAMTAMCAGTTCMGTGMGCACMGGATTMGTTGG 

CTATMGTTTTATTGCMGAGGTAMGMGAAMCTATATATATATATCTTATTO 

TOCCCTGTATAGGTTGACmGCMTTCGGCCTTTTTAGAGGCATTMCTACTCCTCGTMG^ 

TAGAAMCTGCTGCCCAMTTTATTTTATATTTTTGTACAGATTC 

TOTACATATGAGATAGCTATTTTGATAGGATTTGCTCACATAGTTCCTGCAMCTTCAGATGTACMG 
30 TATAGAGTTGTMTGTTTTATATGTGTATGGTGCMGAGAAMTTGGATCAMTCMTC 

CACAGGCACACACATGCACACACCCATAMCACACACACAGTGCTTTMGAAAGGGCCAGGTGATATCACACCCAAATTTCACA 

AGCACTGACCCCCTGGCACCAACACCCGCCAGTACTGTGACTTCCAAAGCCAGAGCCACATGTGCTCATCAMCTTGCATTAAG 

CAGTTGGCGGGAGATGGCTGTGGAGCTGGGGGTTTMGTGATGGTTCTCTTTTGCTC 

TTCTTMGAGTGTATTTATGCCAAGTTTGCGCTTTTMT 
35 GGCATMTTTTTATGATGACCTGAMTTTTACATCCGMCAAM 

GCCCTGTTGCAATGCTTAGGGCCCTTAMGMGAAMTCTCCCCAGMGGCATCCATCATGTTGCTO 

TCCTTTCCOTAGAGCTTTCCCTGTGTTGCTMGAGCTGAAMTGGCATCTTCG 

CCGGCCCGGGATGCACTCTTACMCATGTGTGACTCTTGMCCTGGAGTTCATCACATTACGTCACAGCTTCCCA 
TTTCCTGAGTCAGCTACTTCACACTTGTCMGGCTGTTTTACCCCAAMC^ 
40 CCCCCCMTTCCCCCCCCATCACCTTATCTCCCAGGACACACTTGAGMGTA^ 
TAAAMGGTTGCMTGTATCATGCTTGTTGCCGAMCTGTTTATGGCCTTC™ 
TTTAGCTGmAGTGCAGGTTACAACCTATATTGmTGCAGATGGCTTC 
TTAMTTATGGGATGTTTTGTTGTTGTTGTO 

TGCCATGGATAAMTTGGGTCTTTCTGGCTMTTAAAAAAGACAACT^ 
45 TATTTTTGTMTGCACATGGCAMGCAMGACGTTTC 
TAMGGCTTTCTACATTCTMTTTACTTTTTCCCCC 
GAMCTGATCAMTTGCACTTGTTCTCCTACAAGCMCCTO 

ATAGGACCAGGGACCATGTMCTGAGGTGAGGGTTGTAGTAGATGCTTCCAGTGTCAGTATGCCTC 
CmCTTGCAGAGAACMGTCTGCCCAGATTCCATGCTTTCTATMCTGGA 
50 CTACCTACGTACACATATACMTAGTATTGATGATTCTGAACAATAACAGGGT^ 
ATTTACAGTMCTTACMCMCTGTACTTTC 
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CAMTMGCACGGAGMGTGTOCCCAMC 

MCCCCAAAMCAGCAGGCAMTGCTTTTTMCTCTGACACCGTTGCCATAM 
ATGGAAMTTAGCAGCCCATTTTCAGAMGATCAAAATGATCTAGGGTTCTMTTGCTTTTGCATC 
GTCCCMCAGGGMCAGTAGGAGCTGGAGTGGGATCTCCAAGTCCCAGTTTGAGTGTGGGATGTGOTC 
5 TTTATGAMGACATCACATGGCATCCAGGGCCAGGCAGGCAGCTTGAGGTG 
GGACAGTTATTGACACTGATGTGCMTGM 
' CCCCCCATMGTTATTTATTCCTTTT^ 

CAGAATGTATGGTAGGMTGTATTCTCTTGTAGGMTGTAMTCTC 
CATTGTATGCACAGTCCGCATTCATTTTTACTCTTCTCTAATATGGGTCTATT^ 

1 0 mtacctccamtttttmgaamgcatcam 

ggggcaaccacatgggcacccttgttcataccamgggtgagcagtggccagagcctcctctgcacctctcgagtg 

mttgagctm^atggccatagccccttggagtgccccagctgccctgaggtcmtcmggaam 

tccamgagccamgtatcmcttacagatcgtttttamgcttamtttatgmcca 

ataccgcagggcagccttcttamtgacamtgtaaaaaaaaaaaaa 
. 1 5 gmttgtcttmtttgaamccttgctgttacamttggac 

ttttctggctgtaaatggttaccttcctc^ 

gttttcagtgtttctmgmtgagtctgmtggttcttgaam™ 

agttggacttcttttggggataacmgttttggm 

tacatagtgacatgtttggtgcatcgtttttgaggagggcttt^ 
20 gctgtgccttttctatgcaatattacagacgttacatcggmcccagatggctgtattc 

mcmttggacagattamtgtacatggamtgagcagtctocttttgtagttttata 

33 

MTSRDNYKAGSREAAAAAAMVAAAAAAAAMEPYPVSGAKRKYQEDSDPERSDYEEQQLQKEEEARKVKS 
AKIEARIDEWSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 
25 GmSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLP\^GSVTVLSGYAADEITHCI 
RPQDIKERRAVIILRKTRLDAPRLETKSLSSSVLPPSYASDRLSGNNRDPALKPKRSHRKADPDMHRPRILEMDKEENRRSVL 
LPTHRRRGSFSSENYWRKSYESSEIX^SEAAGSPARKVKMRRH 

34 

AAGTGTTACTTCTGCTCTAAMGCTGCGGMTTCCTCGAGC^ 

30 AAGTCCATGACGTCCCGGGACAACTATAAGGCGGGCAGCCGGGAGGCCGCCGCCGCTGCCGCAGCCGCCGTAGCCGCCGCAGCC 
GCAGCCGCCGCTGCCGCCGAACCTTACCCTGTGTCCGGGGCCAAGCGCAAGTATCAGGAGGACTCGGACCCCGAGCGCAGCGAC 
TATGAGGAGCAGCAGCTGCAGAAGGAGGAGGAGGCGCGCMGGTGMGAGCGGCATCCGCCAGATGCGCCTCTTCAGCCAGGAC 
GAGTGCGCCAAGATCGAGGCCCGCATTGACGAGGTGGTGTCCCGCGCTGAGMGGGCCTGTACAACGAGCACACGGTC^ 
GCCCCACTGCGCAACAAGTACTTCTTCGGCGMGGCTACACTTACGGCGCCCAGCTGCAGM 

35 CTCTACCCGCCGGGCGACGTGGACGAGATCCCCGAGTGGGTGCACCAGCTGGTGATCCAAMGCTGGTGGAGCACC^ 
CCCGAGGGCTTCGTCMCAGCGCCGTCATCAACGACTACCAGCCCGGCGGCTGCATCGTGTCTCACGTGGACC 
TTCGAGCGCCCCATCGTGTCCGTGTCCTTCTTTAGCGAOT 

GTGTCGGMCCAGTGCTTTCCCTGCCGGTGCGCAGGGGMGCGTGACTGTGCTCAGTC 
TGCATACGGCCTCAGGACATCMGGAGCGCCGAGCAGTCATCATCCTCAGG 
40 MGTCCCTGAGCAGCTCCGTGTTACCACCCAGCTATGCTTCAGATCGCCTGTCAGGAMCMCAGGGACCCTG 
MCXTGGTCCCACCGCAMGCAGACCCTGATGCTGCCC 

GTGCTGCTGCCCACACACCGGCGGAGGGGTAGCTTCAGCTCTGAGAACTACTGGCGC 
TCTGAGGCAGCAGGCAGCCCTGCCCGAAAGGTGAAGATGCGGCGGCACTGAGTCTACCCGCCG^ 
ATCCTTACGTAGTTGCCCCTCCTTTTGTTTTGAGGGTTTTG 
45 TTTGATTCTATATATTTTTCCT 

TGGTATTCCTCeTCGATGGAMGGCTGTTGGCATCMTAGGGGACAGAGGCTGA 
AGAGCAGCCATCTTTTMGTGGGGCTGTATCAGGCTGGGTTTATTTAAMGCAACAAM 
CTTTCAG'TGTAMTCTTCGCAGTGTTCTAMCAAAGTTCAGTCTTCTGCTCGC 
GAGGTCTCCTGGAGCCTCACAGGCTCTGCTGTTCTCCACTTCTCACCTGCCATCCACGCC 
50 TCTTCCTCCTGCGGCAGAGTTGTTCAGGTTGCCTGGGCAGGGGCTTAM 
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AGCCCCCCAGGCGTCCTCCACCCACGCCCACTAGCCTGCCATGTCCACAGTTCCTTC^ 
CTGACCTCTCTTATCMGAGCACACTTCTTTGCTGGTTGCTCCTTTTGAGCATATC 
GCCACGTTGGGTCAGTTTTAGAMTTGTTTCTAGCTAGAGGGACTGGTGTCCTTCCMG 
GTTGTGGTGTGTGGTAGGGTTTOTCTTTTCTTTm 
5 TTCTCCACTGGCCAGCTTGGGCCTCATCCTCATGTCATCCTTCTAGGAAGGCGCCTC 
TCCMGGCCAGAGCTCAGGCCTGCAGACTGGGCTGGTGCCTC 
AAAAAAAAAAAAAAAAAAA 

35 

MMTVNLELDPIFLKAI/SFLHSKSKDSAEKLKALLDESLARGIDSSYRPSQKDVEPPKISSTKNISIKQEPKIS^ 
10 KVLTTEKVKKEAEKRPADKMSDITEGVDIPKKPRLEKPETQSSPIW^ 
MVASGNQLVECQECHNLYHRDCHKPQWDKEANDPRL^ 
LKQETTFLAFKRTEVKTSWISGNSSSASVSSSVTSGLTGWMFMKTSSA 

ATSSKGGIGSKIGSMSTTPTVPLKPPPPLTLGKTGLSRSVSCDNVSKVGLPSPSSLVPGSSSQLSGNGNSGTSGPSGSTTSKT 
TSESSSSPSASLKGPTSQDHSSML 

15 36 

gtgagttgaccagagmggatgagagmttgccmgttgtactgmcgccmgctgagmtggtgato 
. tttgcmtggctgctactgtgmcttggmcttgatc 
tctgctgaaaagctaaaagcactgcttgatc 

gagccacccaamtttcmgcacaaaamcatttccattmgcmgagcccaamtatca 
20 mtggcmggtcctcacmctgaaaaggtamgmggmc<:tgaamgagacctgctgataamtgamtcagacatcactgaa 
ggagttgatattccamgamcctagattggagamccagamcacag 
cctatggctgacctttccagttttgaggagaccagtgctgatc 

camtgatggtggcatctggcmtcmttagtagmtgtcaggagtgccataatctctaccaccgagattgtcataaac^ 
gtgacagacmggaagcgmtgaccctcgcctggtgtggtattgtgcccgatgtaccagacamtgaam 
25 actcagamccaccgcagamccagcccctgcagttgtttctgtmctccagctgtcam 
actamctgamcmgagacaacttttctagcgtttmgagmcagmgtcm^ 
agtgccagcgtttcctcgtcagtmctagtggcttmctggatgggcagcttttgcagccaaaac 
acagcaamttgagttcaacmcacaamcmtactgggamcctgctacttcgtc 
ggtctggcmcatcatccamggtggmtaggttccaamtaggttccmtmcagcactacgcccactgtacctot 

30 CCTCCACCTCTMCCTTGGGTAAAACTGGCCTTAGTCGCTC 

agtagtttagttccaggmgcagcagccaactmgtgggmtggamtagtggmcatcaggacctagtggmgtact^ 
aamctacttcagaatccagcagctctccctcaggatcccttaaaggcccmcttcacmgatcaca(x:tc 
gattacagatggtcmgaagamgctgcccaaaagamctcmgmgtmtgtggccmg 
amgatgamggcttattattatgatatmot 

35 AA 
37 

vdsamellfiiotfkhpsaeqsshidvwfpcvwinevrvippgvrahsslpdnrayg 
lgsleydentsiifrpnskwtdglvlrgw^^ 

ngspprpqprgprtppgppppdddeddpvplpvsgdkeedaphredyfepispdrnsvpqegqysdegeveeeqqeegeededd 
40 vdveeeedededdrrtvdsipeeeeedeeeegeedeegegd^ 

dpydrelvpllyfscpykttfeieisrmkdqgpdkensgai^ 

tkqdsi£qlvdwtmqalnlqm 

ldsvismtegmeaflrgrqneksgyqkllelill^ 

daglertnpeyenevgasmdmdllessnisegeierli^ 
45 ylhshhflelvtlllsipvtsmpgvlqatkdvlkflaqsqkgllffmseyeatnlliralchfyd 

lwlqdstqtlqcitelfshfqrctaseetdhsdllgtlhnlylitfnpvgrsavghvfsleknlq 

kksvaynyaci lilwvqs s sdvqmleqhaasiilklckadennaklqelgkv^leplknlrfeinc i pnlieyvkqnidnlmtpe 

gvglttalrvu:nvacppppvegqqkdi^ 

TLLKTMLTELLRGGSFEFKDMRVPSALVTLHMLI^ 
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LKEVLSSILKVPEGFFSGLILLSELLPLPLPMQTTQVIEP 

RICVQLCDLASPTALLIMRTVLDLIVEDIiQSTSEDKEKQYTSQTTRLLALLDALASHKACKLAILHL 
LLALVRSPGDSVIRQQCVEmSII^SLCDQDIALILPSSSEGSISELEQ^ 
CVRTMMFLAEHDYGLFHLKSSLRKNSSALHSLLKRWSTFSKDTGELASSFLEFMRQILNSD^ 
5 MSimELKQLLQSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSWGLKQMLESSGDPLPLSDQDVEPVLSAPESLQNLFOT 
RTAYVLADVMDDQLKSMWFTPFQA^ 

FITSSGKSEYIEPMRAHWPPPRGRGRGGFGQGIRPHDIFRQRKQNTSRPPSMHVDDFVAAESKmPQDGIPPPKRPLKVSQ 
KEQEAPVGVLRILLEEITMKVVEARAILTEALFHHYDPLVLQVTAQVLGWLLEWGDLDLPGLVQIMVEAQEE 
WMYAPLHDKNPFGNI LTVYEHFTRTIKIRH 

10 38 

GGTGGACTCGGCGATGGAGCTGTTATTTTTAGATACTTT^ 

TTTTCCATGTGTGGTmTATCMTGMGTCCGAGTCATACCCCCAGGAGTAAGAGCCCATAGCAG 
ATATGGAGAGACATCTCCCCATACATTTCMmGACTOTTCTTCMCMTGTAAGCAM 
GTTGGGMGCCTGGAATATGATGAGAATACTTCCATCATCTTTAGAC 
15 AGGCTGGTATMCTGTCTGACACTGGCMTATATGGATCAG 

ACCTCCACCGCCACCACCTCCCCAGCCACAACCAAGTTTGAAMGGMTCCAAMCATGCTO 

TMTGGMGCCCTCCMGACCACAGCCMGGGGACCMGMCTCCTCCAGGACCCCCTCCACCTGATGATGATGAAGATGATCC 
TGTGCCTCTGCCAGTGTCTGGTGACMGGMGAGGATGCTCCTCATAGAGMGATTACTTO 

TTCTGTTCCCCAGGAAGGGCAATATTCTGATGAAGGAGAAGTAGAAGAGGAACAACAAGAAGAAGGAGAAGAAGATGAAGATGA 
20 TGTGGATGTAGAGGMGMGAGGATGAGGATGAGGATGATCGACGMCAGTAGACAGTATTCCTGAGGAGGMGAGGMGATC 

AGAGGAAGMGGTGMGAGGATGAAGMGGTGMGGGGATGATGGTTATGMCAAATTTC 

CTTGGMCGTGAAACATTTAAGTATCCAAACTTTGATG 

TGATCCATATGACAGGGAGCTTGTACCACTCTTATACTTCAGTTGTCCAT^^ 

GMGGATCMGGTCCAGATAMGAAMTTCAGGGGCMTCGAAGCCTCAGTGAAGTTMCAGMCTC 
25 AGATAGAGGTGCAAMTGGGTMCAGCTTTAGMGAMTTCCMGTTT^ 

CACAAAACMGACTCCCTTGGCCAGTTGGTAGACTGGACCATGCMGCTTTAMTTTACMGTAGCGCTTCGCC 

CTTAMTGTTCGACAGCTCAMGCTGGGACCAMTTAGTGTC^ 

ACMGCAGGAGTGATCAGTGGATTATTTGAACTTCTGTTTGCTGATCACGTATCATC 

TTTGGACAGTGTCATTAGTATGACAGMGGAATGGMGCTTTTTTMGAGGTAGGCAGM 
30 TCTGGMCTCATACTTTTAGATCAGACTGTGAGGGT^ 

CTTGTCAGAGATTAAMGACTTGGTGACCATTTAGCAGAGMGACTTCATCTCTTCCTMCCAC 

AGATGCTGGACTTGAGAGMCAMCCCAGMTATGAAMTGAGGTGGMGCTTCT^ 

TATAAGTGMGGGGAMTAGAMGGCTTATTMCCTCCTAGMGMGTTTTTCATTTM 

CCAACMCCTGTTMGTCTTTCCCMCGATGGCACGMTTACTGGACCTCCAGAGAGGGATGATCCATACCCTC 
35 ATATCTTCACAGTCATCACTTCTTGGAGTTGGTTACCTT^ 

AGCCACAAMGATGTTTTGMGTTTC1TCCACAGTCACAGAAGGGTCTTC 

ATTGATCCGAGCTCTGTGTCACTTTTATGATCMG 

CTOTGGCTACAGGACTCMCACAGACATTGCAATGTATTACAGMCTGTTCAGCCATTTTC 

MCAGACCATTCAGATCTCTTGGGMCCCTGCACAATCTTTATTTGATTACTTTTM 
40 TGTTTTTAGTCTGGAGAAAMTCTCCAMGTCTTATTACTCTM 

TMGAAGTCAGTAGCTTATMTTACGCATGCATACTTATTTTGGTGGTGGTTCAGTCTTC 

ACATGCAGCATCTCTCTTGMGCTTTGTAMGCAGATGAAMTMTGCTAAATTGCAAGM 

GAAAAACCTTAGATTTGAAATTMCTGCATCCCAMCTTMTTGA 

AGGAGTTGGCCTTACCACTGCCTTACGTGTTCTCTGTMTGTTGCATGCCCACCAC^ 
45 GAMTGGMTCTTGCCGTTATTCAGCTTTTTTCTGCTGMGGAATGG 

TCTGACTCAGCCTTGGAGGCTCCATGTCMCATGGGGACTACCCTTCACAG^ 

CACTCTTCTTAAMCTATGTTMCGGAACTCCTGAGAGGTGGA 

TACTTTACATATGCTCCTGTGCTCTATCCCCCTCTCAGGTCGTTTGGATAGTGATGAACAGAAM 
TATTTTACTGACTTTTACACMGGAGTTMTGAAAMCTCACMTCTCAGM 
50 GTTAAMGMGTTCTTTCTTCMTCTTGAAGGTTCCTGAAGGAT^ 

TCCATTGCCCATGCAMCMCTCAGGTTATTGAGCCACATGATATA^ 
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CCTTCATGTTCMGCAAAGTTGCTO 
GCGTATTTGTGTTCMTTGTGTGACC^ 

CTTGCAMGCACTTCAGMGATAMGAAAMCAGTATACTAGCCAMCCACCAGGTTGCT^ 
ACACAMGCTTGTAMTTAGCTATTTTGCATCTAATTMTGGMCTATT 
5 TCTTTTAGCTTTGGTGCGGTCTCCTGGAGACAGTCTTATTC 

CTGTGATCAGGACATTGCACTTATCTTACCMGCTCTTCTGMGGTTCTAm 

AMTAMGMTTGATGACCTCMTCTGTGACTGTCTGTTGGCTACGCTAGCTMCTCTGAGAGCAGTTACM 

ATGTGTCAGMCAATGATGTTTCTTGCAGAGCATGATTATGGATT^ 

TCTGCATAGTTTACTGAMCGAGTGGTCAGCACATTTAGTMGGACACAGGAGAGCTTC 
10 ACAMTTCTTAACTCTGACACMTTGGATGCTGTGGAGATGATMTGGTCTCATGGMGTA^ 

GATGAGTATTMTGCTGCAGAGTTAAMCAGCTTCTACAM^ 

GCTTGTTTTGGMCATTCAAMGATGATGACMTCTGGATTCm 

ATCAGGTGACCCTTTACCTCTCAGTGACCAGGATGTAGAACCAGTACTTTCAGCTC 

TAGGACTGCCTATGTGCTTGCTGATGTCATGGATGATCAGTTGAAATCTATC 
15 TACAGATCTGGATTTGGTAMGGTTGACTTMTTGMCT 

GCGCTCATTTTTGTCAGMCCATCATCTCCAGGMGMCCMGACTACTAAAGG 

CTTTATMCGTCMGTGGAAMTCTGAATACATTGMCCTGCCAAMGAGCTCATGTO 

GGGAGGATTTGGACAGGGTATACGACCTCATGATATTTTTCGTCAGAGAAM 

GGATGACTTTGTTGCTGCTGAMGTAMGAAGTGGTTCCTCMGATGGMTACCTCCAC 
20 GMGGMCMGAGGCTCCAGTTGGAGTGCTCAGAATACTCCTCGAGGAMTTACMTGAMGTC 

AACAGAGGCCCTCTTCCACCATTACGACCCCTTAGTTCTACAGGTTACCGCCCMGTCCTCGGGACCGTC 

GGGGGACTTGGACCTTCCTGGGCTAGTGCAMTAGCGGCAGTGGAGGCTCMGAGGAMGTCT 

GGTCGTCATGTACGCTCCTTTACACGATAAAMTCCTTTTGGGMCATCTTAACTC 

MTMGACATTGMGGACCAATTTAGACTTAGCAGTTATC^ 
25 TGTTTGATACCTAACAAGATTTCAATAAAAATCCAAACTTTGTATGT 

39 

MAARDSDSEEDLVSYGTGLEPLEEGERPKKPIPL 

DKSVLGPEDFMDEEDLSEFGIAPKAIVTTDDFASKTKDRIREKARQLAMTAPIPGATLLDDLITPA 
QGVGPRVKRRPRRQKPDPGVKIYGCALPPGSSEGSEGEDDDYLPDNVTFAPKDWPVDFTPKDWHGLAYKGLDPHQALFG 
30 EHFNLFSGGSERAGDLGEIGLNKGRKLGISGQAFOT^ 

KDLRYVGKILDGFSLASKPLSSKKIYPPPELPRDYRPVHYFRPMVM 
LLGETPIQGSATSVLEFLSQKDKERIKEMQATDLKMQ 

PFAKDPEKQKRYDEFLVHMKQGQKDALERCLDPSMTEWERGRERDEFARAALLYASSHSTLSSRFTHAKEEDDSDQVEVPRDQE 
NDVGDKQSAVKMKMFGKLTRDTFEWHPDKLLCKRFNVPDPYPDSTLVGLPRVKRDKYSVFNFLTL 
35 RGPDKSRKPSRWOTSKHEKKEDSISEFLSLARSKAEPPKQQS 

IFASSSDEKSSSSEDEQGDSEDDQAGSGEANFQSSQDTDLGETSSVAHALVPAPQEPPPSFPIQKMQIDEREEFGPRLPPVFCP 

NARQTLEVPQKEKHKKNKDKHKAKKEHRRKKEK^^ 

SLPLRRQ 

40 

40 GTGTTAGMTTTCTGTCCCAAAMGACAMGAGAGMTCAMGAMTGMGCAGGCMCTGAC 

GCCAGGAGTCTGGCCCAGMCGCTCAGAGCAGCAGAGCCCAGCTCTCCCCTGCAGCGGCTGCTGGGTACT^ 

GCATTAGGTGGTGGGACGGCCACCTTAAMGCCAGCMCTTCMGCCTTTCGCCAMGATC 

GAGTTCTTAGTACACATGAAACAGGGTCAGAMGATGCTCTGGMCGCTGTCTGG 

CGTGAGCGGGATGAGTTTGCCCGGGCGGCCCTGCTGTACGCATCTTCCCATTC^ 
45 • GAGGAGGATGACTCAGATCAGGTTGMGTCCCTCGAGACCMGAGMTGATGTCGGGGATMGCAGTCGGCTC 

ATGTTTGGGMGCTCACCCGAGACACGTTTGAGTGGCACCCTGACAAGCTO 

CCAGATTCMCTTTAGTTGGCTTACCMGAGTC 

TCCTTGCCCACCACTCMGCATCMGTGAAAMGTATCACAGCACCGAGGTCCCX3ACAAATCMGAAM 
ACCTCTAMCACGAAMGAMGMGATTCCATTAGTGMTTTTTMGTTTGG 
50 TCCAGCCCCTTAGTAMCAMGAGGAAGAGCATGCACCAGMTCATCCGCAMTCAGACAGTCMCAMGATC 
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GCTGMGGAGAAGGGAGCCGCCCATCCATGGACTTATTCAGGGCCATCTTTGCCAGT^ 
GAGGATGAGCMGGTGACAGTGMGATGATCAGGCAGGCTCTGGGGAGGCC 
GAMCATCATCTGTGGCTCACGCTCTTGTGCCAGCACCCCAGGAGCCjXGACCTTC 
GAMGAGMGAGTTCGGCCCGCGGCTGCCTCCCGTCTTCTGCCCCMTGCTCGTCAGACACTTC 
5 CATAAAAAGAACAAAGACAAGCACAAGGCCAAGAAAGAGCACAGGCGGAAGAAAGAGAAGAAAAAGAAACACAGGAAGCACAAA 
CACAMGGCAAGCAAAAGMTAAAAMCCAGAGAAMGTAGTAGCTCCGAGAGTTCCGACAGCAGCGACAGCCAGAGTGACGAG 
GAMCCGCAGACGTGTCGCCCCAGGAGC^ 

GCTCGTCCTAGMTCATTTCTCCTCCATGATGGAAGCCCAGTGATTGTTCAGTTMCGC 
MTTCCTGCTGTAAMTAATTTTTAAMCCTTGACATTTCAM 
1 0 TTTCAGTTAGTATCGGGGGAAAAAMTCCAGATTGMCAG 

41 

MAQVS INNDY S EWDLSTDAGERARLLQSPCVDTAPKSEWEAS PGGLDRGTTSTLGAI F I WNACLGAGLLNFPAAF STAGGVAA 
GIALQMGMLVFI I SGLVILAYCSQASNERTYQEWWAVCGKLTGVLCEVAIAVYTFGTCI AFLI I IGDQQDKI IAVMAKEPEGA 
SGPWYTDRKFTISLTAFLFILPLSIPREIGFQKYASFLSWGT^^ 
15 FGFQCHVSSVPVFNSMQQPEWTWGGVVTAAMVIALAVYMGTGICGFLTFGMVDPDVLLSYPSEDMAV 

PILHFCGRAWEGLWLRYQGVPVEEDVGRERRRRVLQTLVWFLLTIiLLALFI PDIGK VI SVIGGLAACFIFVFPGLCLIQAKLS 
EMEEVKPASWVA^VSYGVLLVTLGAFIFGQTTANAIFVDLLA 

42 

ATGTGATGCATGCTCACGTGTCTCCGCAGCCGGCTCGGGAMGMTCCCCCMGCTC 

20 TCAGTTTCCTCCAGCTCTGCTGMGCCAGCACAGMGTAGCCCAAACTC 

TGAGAMGMGAMTTGGGTCCAGAMGGGMGTGAGGAGMTCAGATCCCAGACCTTTGGGGAGAAGGAGCAACCGCCTCTGG 
CACAGCCCATCAGGGAGAMGAGCAGGTTGAGMGAGTCCTMGCTMCAGCCCCAMCAGGTGGGTG 
GGCATGTGGTTGTMGGCAGMCCCACAGACCTTGCAGGMGAAGGCTCTCGGGGCCATC 
TACAGCGAGTGGGACTTGAGCACGGATGCCGGGGAGCGGGCTCGGCTGCTGCAGAGTCCCTGTGTGGACACAGCCC^ 

25 GAGTGGGMGCCTCTCCTGGGGGTCTGGACAGAGGCACCACTTCCACACTTGGGGCCATCTTC 

GGTGCAGGGTTACTCMCTTCCCAGCAGCCTTCAGCACTGCGGGGGGCGTGGCAGCAGGCATCGCACTGCA 

GTTTTCATCATCAGTGGCCTTGTCATCCTGGCCTACTGCTCCCAGGCCAGCMTGAGAGGACCTACCAGG 

GTGTGTGGCMGCTGACAGGTGTGCTATGTGAGGTGGCCATCGCTGTCTACACCT 

ATTGGCGACCAGCAGGACMGATTATAGCTGTGATGGCGAMGAGCCGGAGGGGGCCAGCGGCCCTTGGTACACAGACCG^ 
30 TTCACCATCAGCCTCACTGCCTTCCTCTTCATCCTGCCCCTCTCCATCCCCAGGGAGA 

CTGAGCGTCGTGGGTACCTGGTACGTCACAGCCATCGTTATCATCMG 
• ATCCTGACCAGGCCGGCTTCCTGGATGGCTCTGTTCMTGCCATGCCCACCATCTC^ 

GTGCCCGTCTTCMCAGCATGCAGCAGCCTGAAGTGMGACCTGGGGTGGAGTGGTC 

GTCTACATGGGGACAGGCATCTGTGGCTTCCTGACCTTTGGAGCTGCTGTGGATCCTGACGTC 
35 GACATGGCCGTGGCCGTTGCCCGAGCCTTCATCATCCTGAGCGT 

GTGGTGGMGGCCTGTGGCTGCGCTACCAGGGGGTGCCAGTGGAGGAGGACGTGGGGCGGGAGCGGCGGCGGCGAG^ 

ACGCTGGTCTGGTTCCTGCTCACCCTGCTGCTGG^ 

GCCGCCTGCTTCATCTTCGTCTTCCCAGGGCTGTGCCTCATTCMGCCAAACTCTCTGAG 
TGGTGGGTGCTGGTCAGCTACGGAGTCCTCTT^ 
40 GTGGATCTCTTGGCATMCCACTGCCTCCCAGGGAACACMGGCCTTTGCCATTGGTCGCAG^ 
GGGCCATTCTTAGTCCACGATCATTCCMCTGGTGGGATGACATCCGGACATCCTCTO 
CACACCTCTGGACAGCTCAMTCCAGTCCCCTTCCTGCTCrc 

CATAGAGGACCCCTCCTCCTCTCCCAGTTCTCMCTTCTCCATGCCTGGMTCCACGGGTGMGA(3A 

'GAMGMTCCAGTCTGACTTCCCTCTGGAGMTGACTATC 
45 GGGTTGTGCTCATTACCCCAGGCCAGTGGTAGCTTCCTCAGGAGCCTGGCCACTTCCAAAGGTAGCACTGM 

CATAGTCAGGTAGATTCAGACCmTCCACACCTTCCTGGGCMCCCC^^^ 

AGACATTTCCCAGGCCTTGAGGGGCCCCArcCTGTCTCTTTCATCAMCCTGATGGTCC 

ATCCCCAGACATCACCAGGTCTMTGTTTACAMCGGTGCCAGCCCGGCTCTC 

TGAGTATTCCTCCGOTAGCTTTCCCCATMGGTTGGGAGTATCTGCTTTO 
50 AGGGTGGGTGATGGAGAAGGCTGAGMCCTTTCAGACCCTCTGTGTGG^ 
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ACCTTCTTTTTCAGTGATGTTTTCTC 
C 

43 

MAQVS INNDYSEWDLSTDAGERARLLQSPCVDTAPK S EWE ASPGGLDRGTTSTLGA I FI WNACLGAGLLNF PAAFSTAGGVAA 
5 GIALQMGMLVFI I SGLVI LAYCSQASNERTYQEVVWAVCGKLTGVLCEVAI AVYTFGTCI AFLI I IGDQQDKI I AVMAKEPEGA 
SGPWYTDRKFTISLTAFLFILPLSIPREIGFQKYASFLSWGTVWTAIVIIKYIWPDKEMTPGNILTRPASWMAVFNAMPTIC 
FGFQCHVSSVPVFNSMQQPEVKTWGGWTAAMVIAL^^ 

PILHFCGRAVVEGLWLRYQGVPVEEDVGRERRRRVLQTLVWFLLTLLLALFIPDIGKVISVIGGLAACFIFVFPGLC^ 
EMEEVKPASWWVLVSYGVLLVTLGAFIFGQTTANAIFVDLLA 

10 44 

GAGGCCMGAGCGCGGGCGGCGAGGCMGATGGCGGCAJVCCMGAGGAMCGGCGTGGAGGCTTTGCAGTTCAGGCGAAGAAGC 
CAAAAAGAAACGAAATAGATGCGGAGCCGCCAGCTAAGCGGCACGCCACAGCAGAGGAGGTGGAGGAAGAAGAGAGGGACCGGA 
TCCCAGGCCCCGTTTGCMGGGAMGTGGAAAMTMGGMCGGATTCTCATCTTTTC 
GACATTTAATGCAGGACTTGAGMTGTTGATGCCTCATTCTAMGCA 
15 TTMCGAGGTTTGTGAMTGMGMCTGTAATAMTGCATCTATTTTC 
AGMTGTATTMGATTTTGGTTAMCTCAT™ 

ttmgggttgagmtgmgcamcttttgattra 
tattatttatggtgaataggtggtmgcato 

amttcacctcacggaccatctgctamttccttgttcaamtagtmgttgactcmtamtt^ 
20 mgtmtcttttgamtagttgtgcaamgttacaacta 

tgmgatgactggaaactgtttgamggttctcggccccttttgtc 
tmttttcttatctggcatggttgtttttagtagatata 

ttatcmttctttcaggtacatttatmtgaggmtagccmgttatagamcamtgagcam 
ctactgttaaatmtatgcttaccttatttttataggcttttgatgmttaccacattatgct^ 
25 caggtamtatctttaamttagctatccaamtatacatgatatatcto 

atctttagtacaccacggtatcatcccaamgccmccatttgtggaccacgtgot 

tggtttcggmctttcagatcatagmgmgatgctgctc 

ttccagggaagttttggaggaccmctttatatgaam 

tagtcttcmtgtaccagmccctttggccaamtgattctgtgmgtmttgctgttttattacctg 
30 ttttgctgcatagamcttggaaattactccctttt 

GCCTTATATAAMTGTTTATTTTTATTTCAAAAGCATCTMTCTTTTTTTTTTTTTm 

CACAGCTGCAAMTACAGAGAGAMCAGCMGTGAAAGATGTGCAAAMCTGAGAMGAMGAGCCGMGACTCTTCTTCCACA 
TGATCCCACTGCAGATGTTTTTGTAACACCAGCTGAGGAG^^ 
TTTGAMGCMGAMGAMCGGATTTACAAAAGGCAMGAAAAATGAM 
35 TGGAAACCTGATTTGTTTTTCAGTTACTTTATATTTATTTTGTATTCAATC 
GTCTMTTAGTCTAGCATTTACMGAMGAAAMTTMGA 
AATTCATCAGTT 

45 

MDWKEVLRRRLATPNTC PNKKKSEQELKDEEMDLFTKYY SEVJKGGRKNTNEF YKTI PRFYYRLPAEDEVLLQKLREESRAVFLQ 
40 RKSRELLDNEELQNLWFLLDKRQTPPMIGEEMINYENFLKVGEKAGAKCKQFFTMVFAKLLHTDSYGRISIMQFF 

WLHQTRIGLSLYDVAGQGYLRESDLmiLELIPTLPQLDGLEKSFYSFWCTAVRKFFFFLDPLRTGKIKIQDILACSFLDDL 
LELRDEELSKESQETNWFSAPSALRVYGQYLNLDKDHNGMLSKEELSRHGTATMTNVFLDRVFQECL 

LENRKEPAALQYI FKLLDI ENKGYLIWSLNYFFRAIQELMKIHGQDPVSFQDVKDEI FDMVK PKDPLKI SLQDLINSNQGDTV 
TTILIDLNGFWTYENREALVANDSENSADLDDT 

45 46 

AAAMTCCGGCTTGGTGAGCTTGGGTCGCCTCTGMGGAGMCCATTTTCCATC 

GGTAGCGGTATTCTCCGCGGCAGTGACAGTAATTGTTTTTG 

GACGGCCITCCTAACCTTTACGGGGCCTGGCGGTGCTGACGCCTC^ 
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GMGCTGCTGCGCGGGAGTCGGCCATGGACTGGA^^ 

AAAAAMGTGMCMGAATTAAAAGATGMGAMTGGATTTATTTACAAMTATTACTC 

ACAAATGMTTCTATMGACCATTCCCCGGTTTTATTATAGGCTGCCTGCTGAAGATC 

C^TCAAGAGCTGTCTTTCTACAMGAAAMGCAGAGAACTGTTAGATMTGMGM 
5 AMCGCCAGACACCACCTATGATTGGAGAGGMGCGATGATCAATTACGAAMCTTTTTGMGGTTC 

MGTGCMGCMTTTTTCACAGCAAMGTCTTTGCTAMC 

TTTMTTATGTCATGAGAAMGTTTGGCTTCATCAMCAAGMTAGGAC^ 

CGGGMTCTGATTTAGAAMCTACATATTGGMCTTATC 

TTTTATGTTTGTACAGCAGTTAGGMGTTCTTCTTCTTTTTAGATC 
10 GCATGCAGCTTCCTAGATGATTTATTGGAGCTM 

CCTTCTGCCCTMGAGTTTATGGCCAGTACTTGMTCTTGATAAAGATCACMTGGCATGCT 

CATGGMCAGCTACCATGACCMTGTCTTCTTAGACCGTGTTTTCCAGG 

ACCTACTTGGACTTTGTCCTTGCATTAGAAMCAGAAAGGM 

AACAAAGGATACCTGMTGTCTTTTCACTTMTTATTTCTTTAGGGCCA 
15 GTTTCATTTCMGATGTCAAGGATGAMTCTTTGACATGGTAAAACC 

MCAGTMTCMGGAGACACAGTMCCACCATTCTMTCGATTTGMTGGCTTCTGGACTO 

GCAMTGACAGTGAAMCTCTGCAGACCTTGATGATACATGATCTCTGAMGACTAGACTGT^ 

TGCTGCATGTAMGCCTTTAAAGCAAMTCCTCAGAMTG 

AAAAAAAAAAAAAAAAA 

20 47 

MPKAKGKTRRQKFGYSVNRKRLNRNARRKAARGIEC SHIRHATOHAKS WQNLAEMGLAVDPNRAVPLRKRKVKAMEVDI EERP 
* KELVRKPYVLOTLEAEASLPEKKGmSRDLIDWRYMVENHGEDYKAMA^ 
SLQKRKMEVE 

48 

25 GTTACACGAGGTCTGAGAGACAGAGGCAGCGTGTTTGAGCTGC 

CCCGGAGGCAGMGTTTGGTTACAGTGTCAACCGAAAGCGTCTGAACCGGAATGCTCGACGGAAGGCAGCGCGCGGGATCGAAT. 
GCTCCCACATCCGACATGCCTGGGACCACGCTAMTCG 

GGGCGGTGCCCCTC(^TMGAGAMGGTGMGGCCATGGAGGTGGACATAGAGGAGAGGC 
ATGTGCTGMTGACCTGGAGGCAGAAGCCAGCCTTCCAGAAMGAMGGAMTACTCTGTCTCGGGACCTC 
30 GCTACATGGTAGAGMCCACGGGGAGGACTATMGGCCATGGCCCGTGATGAGMGMTTACTATCMGATACCCCAAMCAG^ 
TTCGGAGTMGATCMCGTCTATAAACGCTTTTACCCAGCAGAGTGGC 
AG&TGGAGTGACTGGTTTAGATCACAGCTGCCCCAGGCTC 
AGGAGGTGCTGTGTGGCTCCAGAGGAGCTGGCCAGGTCCCATGGM 

GGGGMGGMCTGTTCTCAGAGGCTCCMTTTATATTCATCTGGGGGTTCACGGAAMGCCAGMCCTGC 
35 GTGATGTAMTATAGTGTGTACATMTAMGCAMTATATmAAAAAAAAAAAAAAAAAA 

49 

MMPEFQKSSVRIKNPTRVEEI ICGLIKGGAAKLQI ITDFDMTLSRFSYKGKRCPTCHNI IDNCKLVTDECRKKLLQLKEKYYAI 
ETOPVLT\TCEKYPYMVEWYTKSHGLLV^^ 

VYHPNVKWSNFMDFDETGVLKGFKGELIHVFNKHDGALRNTEYFNQLKDNSNIILLGD 
40 RVDELLEKYMDSYDIVLVQDESLEVANSILQKIL 

50 

ATGATGCCAGMTTCCAC^VAMGTTCAGTTC^ 

GGAGCTGCCAMCTTCAGATAATMCGGACTTTGATATGACACTCAGTAGATTTTC 
CATMTATCATTGACAACTGTMGCTGGTTACAGATGM 
45 GAAGTTGATCCTGTTCTTACTGTAGMGAGAAGTACOT 
CMGCTTTACCAAAAGCTAAAOTAAAGAMTTGT^ 
MGCTCC^ACMCATAGCATCCCCGTGTTCATATTTTCGGCTGG 
GTTTATCATCCCMTGTCAAAGTTGTGTCCMTTTTATGGAT^ 
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ATTCATGTATTTAACAMCATGATGGTC 

CTGGGAGACTCCCAAGGAGACTTMGMTGGCAGATGGAGTGGCCMTGTTGAGCAC 

AGAGTGGATGAGCTTTTAGAAAAGTACATGGACTCTTATGATATTGT^ 

ATTTTACAGAAGATTCTATAA 

5 51- 

MHTLVFLSTRQVLQCQPMCQALPLLPREL^ 

MQAVILGLTARLHTSEPGASTQPLCRKHALRVLDMTG^ 

VDLRVNRASYAFLREALRSSVGSPLRLCCRDLRAEDLPMRNTVAL^ 

LRLHYVHGDSRQPSVIXSEDNFRYFIAQMGRFTCLRELSMGSSLLSGRLDQLLSTLQSPLESLELAFCALLPEDLRFL^ 
10 HLKKLDLSGNDLSGSQLAPFQGLLQASAATLLHLELTECQLADTQLLATLPILTQCASLRYLGLYGNPLSMAGLKELLRDSVAQ 

aelrtwhpfpvix:yeglpwpppasvlleasineekfarve^ 

• 52 

cggggctggaggcggtggctgcggttgcgggaccggcactatgctgggccttcctaccacttgtc 
agggictctcctccctgctgmgtccctctcctgcagg^^^ 

15 gcacacgckaggtgctgcagtgccagcc^ 

aggtggccttcatggacaagmgacagtggtactgcgcgagttggtacacacgt^ 
tgctacaggagtgtgcccactgcagccgtgccctcctgcaggagcggcctagcactgagagcatgcaggctc 
tgactgcccggctccacacctcagagcctggggccagcacacagcccctctgcaggaagcatgcgctgcggg^ 
cgggcctcttggatgatggtgtggmcaggatcctggcaccatgagcatgtgggactg^ 

20 ttgcccagcagcagggtggggccgcagagcctgggccagcccccatccccgtgga(&tc^ 
cctcctatgcgttcctgcgggaggcactccgmgcagcgtgggcagcccgctgcgg 
acctgcccatgcgcmcactgtggccctgctgcagcttctggatgcaggctgcctgcgccgcgtggacc 
tgggcctgcgcggcctgtctgtgatcatcccacacgtggcccgcttccagcacctggccagcctgcggctccactatc 
gggattcaaggcagccctccgtggatggcgaggacmcttccgctacttccttgcccagatgggcc^ 

25 agctcagcatgggctcctctctcctttcagggaggctggaccagctgctcag 
tggccttctgtgctctgctgcctgaggacc^^ 

gtggtaacgacctgtctggcagccagctggcacccttccagggtctgttgcaggcatcagcagc 
tgactgagtgtcagctcgcagacacccagctgttggccacactacccatcctgactcagtgcgccagt^ 
tctatggcmcccactgtccatggcgggcctcmggagctgctgcgggactc^ 
30 accccttccctgtggactgctatgagggcttgccctggccgccgcctgcctctgtcctgctggaggcctc 
agtttgcccgcgtagmgctgagttgcaccagctgcttctagcctcaggccgtgcccatgtg 
ggcgactggctgcggactacttcagcctatgatgmgtagctctgggtgagacacaggccgccctgcagtc 
agggcctttgctgggacccctggtggaggccttcacaamgcactggttactggtttcctc ■ 
acacctcaagcctcccctgctttctgcagtgccccacgcggttttccctgcacttgctcca^ 

35 CCCGGGGCTGGATGTCAGGCCTCCATTGCCCTGCTC 

ATGCTTTGGGATTCCAGTTGTGAGCTGAGAGAGATGATGGCCTCTC 
CTGGGGTTGAAGTTGGACAGAGGCCTGCTTCAGGGMGCTGGGAGTCCCCAGGC^ 

CCACCCAGCCACCCCACTGGGCTGGGCTCGGGCTGGAGGGGGTCATCMGGTACACATGTGCCTGGMGTTGM 
TTTTTTTTTCTGTCCACGTTGGTCACCCTTATCCTTATCTC 

40 AGCCTGTCGCCCCGCCCTTCGCGGGGCAGCCCCGTCGGCACTGCCGGCCAGTCCTTGCTCTTC 
TCCTACTGTGGCCGGCCAGGGCCAGCGAGGGACCCCCCCCATGCAGAGCTGGAGGTTGGGGT^ 
MGGGAGGTGTTGGGGCCTCCCCGGCCACCTTCCATTGCTACCCCAGGATTCCCGA 
CACGGCTCAGCGCACACTGCGCGGCTTCCACCTTT^ 
. !TCATCCGCGCGCCCGGGGCCCTAGCAGTGGATCTCGTAGGCGACCGGCGGGGGCACGCGGAGTC 

45 GGGCCGCAGTCAGCGGGCGCCTCCGCCGGACCCTCGGCGAAGAGCGGCTTGGAGCGGTTGATGACGMCATCTCG 
TCGTCGCGGAGCTCCTCTAGCTGTGCGMCGTACAGGGGCCGTCCMGTAGTCGTTGA 
CCCCGCGCCTTTTTTCGCCTGCGGCGCCGGCGACAGATCATGGCGACCAGGAGCAGCGCCGTC^ 
GCCGCMTGGCCGTCTGTGTGGCCACGCCCAGGGCGCGGAAGGCCATGCTGCCCGCCTCGGGCCG 
CGGGCGGCCGGAGGCGGCGGTTGCGCGGGCTGCTGCGGCTGCTGCCGGGACGCGTTGACCAGGAGCC 

50 GCGCCX3CCGGCGTTGGAGGCCTCGCACTCGTACTTACCGGCGTGCGCCAGCGTGATGTTGCTGAGGM 
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gtgtcggatgccgagtgtcccx^ccaggcc^ 

ggcacctttctccaggtcaccmtggctgcgggtagccggaggcttggcaggcaacccgcaggtc 
* agctccagcggctgcacgtggacagagggcggmtgcagatgaggctgctgtggg^ 
cgcgggggctctgcacacatgatcttcctgtccctggaggtgagcagccgctggccgccctccttg 
5 tgcagggcgcagtcacagcgccatgggttctctgtgggagagcagcgttaggcagg 
gtgcagttggggttttgcaggccaggacagaggcctctttcccacc 
ccccttccttgaccccmgctcctogggcggcaggcccttcacot 
tacagatcacttctccgtcggtctctgagamgcacctgctccttaagtcttc 
gggcgtccacatagacatctcaccagcactgamcctcacmgtcctctcagctttgcctttggatgcc^ 
10 ccccagtcctggtcag£tgtctctctcctttgcmtct 
ttmgtcaccctgtggagttcctgtcttctgttttc 

gccctccagtgtggctgcaggcggtggtgcagccttccagactgctgcccagttgcctgatgtc 
cctgctccctactgccmcaccgtggcccagacagagacgctttccgaggaagaggtacctgtgaggcgcaggacttgcagact 
ggccaggggctgcagggcctctcggctgatggtgcccag 
15 tagagcctggtcctccagcagctcaatg^ 

aggaaamgggckjccgaggttgggggcatgtctctcttcttac^ 
cagggcgaggggccatmtggagtatccc^ 

ccgcgccagctggttgcccgccaggtagagcacgcgcagctgggccaggcctacgaaggcgccgctgcgcaagccgcgcagccg 
gttgctagtgagcgccagctccagcaggcgcggctgcgcgcggmggcgccggcctccagggcgcgcaggctgttc 
20 gtagagccggcgcagagcggcgagtggcgccagggctcccggctctaggcgggcgatgttgttgtcctgcaggaacagtgtctg 

CAGGCCGGGGAMGAGGAGGCGCCTACCCCGCTGCGGGGGTCTTCCTCTCCCTGGGGGCACC^ 

GGTGCCCACCTGCGTCCCTGGCGGGATTCCCAGCGGGACGACGCGCMCCGCAGGGCGCCACACTCCACCGTGGCGCTGTAGCA 
GCGGCAGGCTGCTGGGCAGCCGGCGGCGCGGAGCGGCAGTAGTAGCAGCAGCAGCGGCAGCAGTTCGGGGGCCCTCAGGG 
CTCCCGAGGCCCGGTTCCTCACCGGCCCTTCCGCGGTTCAGCCGCAGACGCGTGCCCTCCTGAAACACAGGTTGGCAGGCCAGT 
25 CTCGGCAGTCGAGAGCCAGCCMTAGATGGMTGGAGGCCTGCACCTGCGTCTAACTTTTC 
CTAATAAAACGTTCTGGTTTTCTCCTTTGAC 

53 

MLQTSNYSLVLSLQFLLLSYDLFVNSFSELLQKTPVIQ 
TAVYFALS ISLHVWMLRWKNSNSFIWTDGLQML 

30 54 

ATGCTCCAGACCAGTMCTACAGCCTGGTGCTCTCTCTGCAGTTCCTGCTGCTGTCCTATGACCTCTTTC 

GAACTCCTCCAAAAGACTCCTGTCATCCAGCTTGTGCTCTTCA 

TTCCTCATGTTCTTCAACACCTTCGTCTTCCAGGCT^ 

ACAGCTGTGTACTTTGCCCTCAGCATCTCCCTTCATGTCTGGGTCATGMCTTACGCTGGAAA 
35 ACAGATGGACTTCAMTGCTGTTTGTATTCCAGAGACT 
CTACXSCGATCCTCACTTCTACCAGGACTCTTTGTC^ 

55 

mggfqrgkygtmegrsednlsatppalriilvgktgcgksatc^ 
sifesqadtqelyknigdcyllsapgphvlllviqlgrftaqdtvairkvkevfgtgamrhw 
40 dncslkdlwecerrycafnwgsveeqrqqqaellavierlgreregsfhsndlfldaqllqrtgagacqedyrqyqmvewq 
vekhkqelrenesnwaykallrvkhlmllhyeifvflllcsilffiiflfifhyi 

56 

GTTCCCAATCAGmCCAGCCMCACCAGGGTGTCCTAGTCCGCAGAGGTGTGGGG 
TTTGTGCAGCTGAGTCATGGAGCTTTCAGCCCCAGCA^ 
45 MCAGCATCCCCGCACACAGACGCAGAGCAGGACTCTCTCTGCTGCCACTTC^ 
TCAGTGACTGCCACCCTGGAGGACAGGGCACMCMCCGTTTC 
ATGGCTGMGGTAGATCAGAAGATMCTTGTCTGCMCACCACCGGCA 
AAAAGTGCCACAC&GMCAGCATCCTTGGCCAGCCCGTC 
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GTGAAMCAGGMCATGGMCGGGAGGAMGTCCTGGTGGTTGACACGCCCTCCATCTTTC 
CTGTACMGMCATCGGGGACTGCTACCTGCTCTCTGCCCCGGGGCCCCACGTCCTGCT^ 
ACTGCTCAGGACACAGTGGCCATCAGGMGGTGAMGAGGTCTTTGGGACAGGGGCCATGAGACATGTGGTC 
CACAMGAGGACTTAGGGGGCCAGGCCCTGGATGACTATGTAGC^ 
5 TGTGAGAGMGGTACTGTGCCTTCAACAACTGGGGCTCTGTGGAGGAGCAGAGGCAGCAGCAGGCAG 
GAGAGTCTGGGGAGGGAGCGAGAGGGCTCCTOCACAGCMTGACCTCTTC 

GGGGCCTGCCAGGAAGACTACAGGCAGTACCAGGCCAAAGTGGAATGGCAGGTGGAGAAGCACAAGCAAGAGCTGAGGGAGAAC 

GAGAGTMCTGGGCATACMGGCGCTCCTCAGAGTCAMCACTTGATGCTTTTGCATTATC 

TGCAGCATACTTTTTTTCATTATTTTTC 
10 CCCCATGGAGTCATTGTTCTMTMTCACCM 

GCTCCCATTTCCCCTTCTTCCTGATAGACTTGGAGCTGTGTGCCTCCACTCCMGGCTGCCT^ 

CTCTGTCTGCCMCMCTGCTTCAGGMTGGGCCmGATCCCATGCAGGTCCCTGAG 

GGATGGAAGATCTCTCCTTAGATAGMCCTGTCTTCC^ 

TTTGTGCTGCCAACCTGAGATTGMGGCAGTGCCTCAGAGC^ 
15 TGCAGCCTGCCCTATCTCCGCATTTCCAGTTGTATTAGCCMT 

TCTACCTGCATTCTAAMCATTCAAAGTAATAAAAATTTCTCCACATT 

ELGIPALLDPNDMVSMSVPDCLSIMTWSQYYNHFCSPGQAGVSPPRKG^PCSPPSVAPTPVESEDVAQGEELSSGSLSEQGT 
GQTPSSTCMCQQHVHLVQRYLADGRLYHRHCFRCRRCSSTLLPGAYENGPEEGTFVCAEHCARLGPGTRSGTRPGPFS 

20 HQQQLAEDAKDVPGGGPSSSAPAGAEADGPKASPEARPQIPTKPRVPGKLQELASPPAGRPTPAPRKASESTTPAPPTPRPRSS 
LQQENLVEQAGSSSLVNGRLHELPVPKPRGTPKPSEGTPAPRKDPPWITLVQAEPKKKPAPLPPSSSPGPPSQDSRQVENGGTE 
EVAQPSPTASLESKPYNPFEEEEEDKEEEAPMPSLATSPALGHPESTPKSLHPWYGITPTSSPKTKKRPAPRAPSASPLALHA 
SRLSHSEPPSATPSPALSVESLSSESASQTAGAELLEPPAVPKSSSEPAVHAPGTPGNPVSLSTNSSLASSGELVEPRVEQMPQ 
ASPGLAPRTRGSSGPQPAKPCSGATPTPLLIiVGDRSPVPSPGSSSPQLQVKSSCKENPFNRKPSPMSPATKKATKGSKPVRPP 

25 APGHGFPLIKRKVQADQYIPEEDIHGEMDTIERRLDALEHRG^ 

VFKQQNLEQRQADVEYELRCLLNKPEKDWTEEDRAREKVLMQELVTLIEQRNAIINCLDEDRQREEEEDKMLEAMIKKKEFQRE 
AEPEGKKKGKFKTMKMLKLLGNKRDAKSKSPRDKS 

58 * ' 

GGAGCTGGGGATCCCCGCTCTCCTGGACCCCMTGACATGGTCTCCAT^^ 
30 GTCCCAGTATTACMCCACTTCTGCAGTCCTGGCC^ 

GTCTGTAGCACCCACTCCAGTGGAATCAGAAGATGTGGCTCAGGGCGAGGAGCTCTCCTCAGGCAG 

CGGCCAGACCCCCAGCAGCACGTGCGCAGCCTGCCAGCAGCATGTGCACTTGGTGCAGCGCTACCTGGCTGACGGCAGGCTC 

CCATCGCCACTGCTTCCGGTGTCGGCGGTGCTCCAGCACCCTGCTCCC 

TGTGTGTGCAGMCACTGTGCCAGGCTGGGCCCGGGGACACGGTCGGGGACCAGGCCTGGGCCCTTC 
35 GCACCAGCAGCAACTCGCAGMGATGCCMGGATGTTCCAGGAGGCGGCCCCAGCTCCAGTGC 
TGGACCCAAGGCCAGCCCTGAGMXCCGGCCGCAGATCCCTACC^ 

CCCTGCGGGCCGCCCCACCCCTGCCCCCAGGAAGGCCTCTGAGAGCACCACCCCAGCACCCCCCACGCCCCGGCCCCGCTCCAG 
TCTGCAGCAGGAGMCCTGGTGGAGCAGGCTGGCAGCAGCAGCCTGGTGMCGGGAGACTGCACGMCTO 
GAGGGGGACACCGAAGCCGTCCGAGGGGACACCAGCCCCCAGGMGGACCCCCCAT^^ 
. 40 GMGMGCCAGCCCCACTTCCCCCAAGCAGCAGCCCGGGGCCACCAAGCCAGGACAGCAGGCAGGTGGAGAATGGAGGCACCGA ' 
GGAGGTGGCCCAGCCGAGCCCMCGGCCAGCCTGGAGTCCAMCCCTAT^^ 
AGAGGCTCCAGCTGCACCCAGCCTGGCCACCAGCCCTGCCC 
CGGCATCACCCCTACCAGCAGCCCCMGACAAAGMGCGCCCTGCC^^ 
CTCCCGCCTCTCGCACTCGGAGCCGCCCTC^ 
45 CCAGACTGCAGGTCCAGAGCTTCTGGAGCCGCCAGCTGTGCCCMGAGCTCCTCAGAGCCTGCTGTC 
TGGAMCCCTGTCAGCCTCTCTACCMCTCCTCCCTC^CCrcCTCTGGGGM 

AGCCAGCCCTGGCCTTGCCCCCAGGACCAGGGGCAGCTCAGGTCCCCAGCCAGCCAAGCCCTGCAGTGGTC 

TCTCTTGTTGGTTGGAGACAGGAGCCCGGTGOT 
GMTCCTTTTMCCGGMGCCATCA 

50 TGCCCCTGGACACGGCTTTCCACTCATCAMCGCAAGGTCCAGC^ 
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ggataccattgac<:gccggctggatgccctggagcaccgtggggtgctgctc^ 
ccgtgaggatcacatgctggtggactggttcmgctcatccacgagaa 
tgtcttcmgcagcagmcctggagcagcgccaggctgatgtcgagtatgagctccggtgccto 
ctggacggaggaggaccgggcccgggagmggtgctgatgcaggagcttctgaccctcattgagc 
5 ctgcctggatgaggaccggcagagggaggmgaggmgacaagatgttggmgccatgatcmgmgam 
ggctgmcctgagggcmgmgmggggaagttcaagaccatgmgatgttgaaactg^ 
cmgtcccccagagacmgagctmcagcacgagaagccagttggggactgccccctcc^gagcagctcctgggctc 

GTTTGMGGGGGCGCCCTGCTCCCCTCAGATCAGTCAC^AGGMGATGACTMGGGGAGGGATCCTC 

CTCCTCAGGGACCTCTGACTGCTCTGGGCCAAAGMTCTCTTGTTTCT 
10 CMCCTGATTCTGATGACTGCGGATGCTGTGACGGACCCMGGGGCAMTAG 

GCACTTCCGCCCCTCACCCTGCCCAGCCCCTGCCATGAGCTCTGGGCTGGGTCTC 

GCCAGCMGTGGCGCTGGGCCACACTGGCTTCTTCCTGCCCCATCCCTGGCTC 

CCCTTGGATCTCAGTTTCCCTCACTCAGGMCTCTGTTTCTGMGTCT^ 
- CTGTCAGACGTGAATGGGCCTGACGGGCAAATCCATCCCTCTCTCCCTCACAGTTCCAGGAGCGGCTTCCCT 
15 CTCCACAGGGAGCCTCCCTTGCCAGGACCAGGGCTGCGACGGCCATGCTGGGGCAGGTGAGTGCTCTGTTAG^ 

CTGTCCCCAGGCTGCAGTTCTGGTCCCTGGTTGTCAGGTAGGMGGGTGCACTTGAAGCAGGTGCTCATC 

TTTATAGTCTGACCCCTCACTTAGGCTTTCCTCTGCCACCCCGGTCCAGGGMGAGGCTCGCTCCC 

TCTGTCTGCTCTGTTGTCTGTTCTTTCCC 

gagaggaggagtgatggggaccgccaccttttctgcaggamtgtgcccagcagctcttg^ 
20 atctctgggatgcctctaggcccccttccctctacacacctctgggaamga™ 

ACCMTAMCAGACMCCTCMCTGCCAGTGCCCTGCAGCCTCGGGCCACAGCGGCAGCCTTGT^ 
GCCACACCTGGTGGCTGMCATCTCTGGTCGCCCAGAGGCCATGTTGGGGCCATCCTCCMGAGGGATCTCTGCCCTCACC 
TGCCACTGGGCAGGATCCCTTTCCTCTGCAGGGAGAGGTGGCTCCTCGGCCATGCAGCCCCTGGCAGGCTCCTTCTA 
CTGTTGACCTGGAGCTGGCGCCACCMCTCCAGGGCCTTTCCAGGGCCAGACAGGTAACACGCATGM 
25 GACGGGCTGTTTCGGTGTCAGGAGACAMGCTGGCAGGGGCAGGGGTGAACTGGAGGCMGTCMGTCACCTGTGGCC 
GCTGMTGTGGGCCCGGTGTTGCCAGATCCTTTGTCA 

GCTCACTGCAGGCTCCGCCTCCTGGGTTCACGCCATCCTCCTGCCTCGG 
GACGCCTGGCTAJ^TTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAACCAGGAT^ 
30 ATCCACCCACCTTGGCCTCCCAMGTGCTGGGATTACAGGCGTGAGCCACCACGCCCGGCCACTAGCCTGMTTTCMTCAAGG 
GTTGGCTGATACTGTGTGTCCAGGGTGGACTGGATTTGTO 

CCTTCGAGGTCGAGGACMCTGTTCCCATTAGATTGCACCCTCTGCCCTCAGGTTCTTGAGGGTGTC 

CATGGGATGTCCCTGAGCCGGCCCTTGATTGGGGCCTCACCATTT^^ 

GTGACCTGATGGGATTATGGGACTCCCTCCAGGTGCCCGAGACMGGTTGATATTTC 
35 MGCAMTGACAGAATACCGGAGMGGCAGGGATCGTGGGTGTCAGGAGCCAGAGGGGAGGGGGACAGATGTGCTGTGT^ 

ACAAGGTGTCAGGTGACTCCTTCCCAGCAGGGCCTCGCAGATGCACMGCACGGAGCTGGT^ 

GCGGCACATGCAGGGATTGGMCTCCCAGGCCAGGGCTCTAGGTCGCTCCCACCTTTTC 

TAGTGGAMGACATAMGCTAAAGCCMCTTTTMTCCTC^ 

TTGAGACTTAGTTTTCACAGAGGGATMTGMCCGTTGCAGAGGTTTATTGAGATCAT^ 
40 ACTGCACTCCAGCCTGGGCGACAGAGCMGACTCAGTCTCAAAAAAAAAAAAMTTAGCTGGGCAGGG 

GTMTCCTAGCTACTTGGGAGGCTGAGGCATGAGAATTGCCTGMCCCAGGAGGTGGAGGTTGCA^ 

CTGTACTCCAGCCTGGGTGACAGCGCGAGACTCCGTCTCAAAAAMGCTGGGTGTGGGGMCACCTGTGGTC 

GAGACTGAGGCAGGAGGATTGCTTGAGCTCAGGAGTTCTGGCTGCAGTGAGCTATGATC^ 

ACAGAATGAGAGCGACACTGTCTCAAAAAAAAAAAAAAAAAMGGCCGGGAGCGGTCGTm 
45 GGAGGCCAGGGTGGGCGGATC^CTTGAGGCCAGGAGTTCMGACCAGCCTGGGCMCATGGTGMTC 

MTTAACTGGACATGGTGGTGGACACTTGTMTCCCAGCTACTCAGGAGGCTGACACATGAGMTTC 

GAGGTTACAGTGAGCCGAGATAGCACCACTGCACTCCMCCTGGGCACAGAGTMGGCTCTGTCTTT 

59 

MASEGPREPESEGIKLSADWPFVPRFAGLNVAWLESSEAC^FPSSMTYYPFVQEPPWEQKIYTEDMAFGASTFPPQYLSSE 
50 ITLHPYAYSPYTLDSTQimSVPGSQYLYNQPSCYRGFQTVKHRNENTCPLPQEMKALFKK^ 
SSEIKSARGSHHLSIYAENSLKSIXSYHKRTDRKSRIIAKN^^ 
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SLLREVWPMVLSKGEIWKNNPNESOT^ 

VSIPSSEALSSDPSYNKEKHIIHPTQKSKASQGSDLEQNEAS 

RIETPKFQSKQQPQDNFKNNVKKSQLPVQLDLGGMLTAL^ 

HNPLDSSAPLMKKGKQREIPKAKKPTSLKKIILKERQERKQRI^ENAVSPAFTSDDTQIX5ESGGDDQFPE 
5 STPSVEDKSEEPPGTELQRDTEASHLAPNHTTPPKIHSRRFRDYCSQMLSKE^ 
LVLGLREVLKHLKLKKLKOTIISPNCEK^ 

DQFHKMVELTVAARQAYKTMLENVQQELVGEPRPQAPPSLLTQGPSCPAEDGPPALKEKEEPHYIEIVJKKHL^ 
SLEASTSQMMNLNL 

60 

10 .CCMGCCGACGGCCCGCTGCTGGCCTCCGTGACGCGGCCTCCTCCGCGCCTCGCGGCATGGCGTCGGAGGGGCCGCGGGAGCCC 

GAMGCGAGGGCATCMGTTATCAGCAGATGTCAAACCATTTGTCCCCAGATTO 

TCAGMGCATGTGTCTTCCCCAGCTCTGCAGCCACATACTATCCGTmTTCAGGMCCACCAGTGAC 

ACTGMGACATGGCCTTTGGAGCTTCMCTTTTCCACCTCA 

CCTTATACCCTTGACTCCACACAGMTC^ 
15 TTTCAMCAGTGMGGATCGAMTGAGMCACATGCCCTCTC 

GAGAAAAAMCGTATGATCAGCAAMGTTTGACAGTGAMGGGCTGATGGMCTATA 

TCACATCATTTGTCCATTTACGCTGAGMTAGTTTGAMTC 

GCAAAAMTGTATCTACCTCCAAACCTGAGTTTGMTTTACCACACTGGACTTTCCTGM 
TCAGAGATACAGMGCMCCCMGTGGGGACCTGTCCACTCTGOT 
20 GCTGCAGTGTTATCAMGGGTGAMTAGTGGTGAAAMTAACCCAMTGAATCTGTMCTGCT 
TCATGTACMGAGAGTTATCTTGGACACCMTGGGTTATGTTGTO 

mtgttacttctatgatamcttamgaccattgcttcatcagcagatc 

tcttcggatccttcctacmcamgaaaaacacattattcatcctaccc 

caaaatgaagcctcmgaaagm^ 
25 aggattgaagatgccgaggmtttcccmcctwcagto 

mgcagcagccacaggatmttttaaamtaatgtaaagaagagccagcttccagtgcagt 

gccctggagmgmgcagcactctcagcatgcamgcagtcctccamccagtggtagtctcagttggagcagtgccagtcctt 

tccaaagaatgtgcatcaggggagagaggccgccgcatgagtcaaatgaagaccccgcacaatcccttggactccagcgcccca 

ctgatgmgamgggmgcagagggagatccccmggccmgaagccmcctcactgm^ 
30 gagagmagcagcgtctccmgaamtgctgtgagtccagcttttaccagtgatgacac 

gaccagtttcccgagcaggcagagctgtcagggccagaggggatggacgmct 

gmgagccaccaggcacagagctccagagggacacagaggcctcccaccttgctcccmtcacaccaccttccctaagatcca 
agccgcagattcagggattactgcagccagatgcttagtaaagmgtggatgcttgtc 
cgtttccmgaccgtatgtaccagamgatccagtcmggccmgacta^ 
35 amcacctgmgctcaaaamctgamtgtgtcattato 
actttgcacacmttattgattatgot 
ttgmtmggcagttcctgtcagtgtggtggggatc 

acagtggcggcccgacaggcgtacmgaccatgctggagaatgtgcagcaggagctggtgggagagcccaggcctc^ 
cccagcctactcacacagggccccagctgccctgcagmgatc&cc^^^ 

40 GAAATCTGGAAAAAACATCTGGMGCATACAGTGGATGTACCCTGG^ • 
ATGMTTTGMTTTATGAGAGTTCTTGCCTGTGTGTC 
TTCTTCTGTTTTTCATGACMTGTAATTTGTC 

CTGCAGGCGTTCAGTGCTGCGGAGCCTGTTAMGGTCACTCAGATGTGCAGGTGTTM 
CTCTGGCTTTCTGAGCACACTACGGATCTGGAAAATAC 
45 TCTCCATTGCAGMTAGCTGAGCCMGTGAGTGAGTTTGCAGAMGCAGGTGGTC 

gggccattcctgcccggcmcagcaccgtcctgcagggagccacttggcagmgggtgcagggc 
ggctacagggamgggccctttctcaggggatgtagctttt^ 
tcagmggaamttggttttctmcctgtgactttttgaaatgm™ 
tamaagtacctagatttgtmatmtcaacaaatctttc 
50 ccaggaggacagagtttgctttcatattttccctgtaagtm 
aaaaaaaaaaaaa 
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62 

CTCirTGCTTTTGCATGGCTTGGCT 

GGTTTCTCTCAGTGGCACATTTTGCTTTTTCTGAGCCCCAMTACATTC 

GGTCATGGACTGAATTATGTGACTTTAMGGATGT^ 
5 CGTATCCAAAAACCCCGGGGCTATTGGMCCCTTCCMCACTTTTTCCTTO 

TGTTGGCTGTGCTGGTGTATTGCCAGACAGCTGTCTTTTGGTTCCCA 

TATCAAAGAAACCACCACCCTTCTTCCTMCAGCATTTTATGrc 

TGCCCCTCCMTTAAmCATGTGTCCMGMTMTCCM 

TCTTMGGCCAGTAATTTATCTGAAMGGTATTTT^ 
1 0 TTCATAGGGTAAMTCCAAGAAMGAGMGGATGTGTGGGGTTTC 

TATGATCCTTCTCTGCTAGMCAGGTTMTTCTCCAMTTTGTTTTGT^ 

AGCAATGGTCGGATGTAMTMCATTTAMGTATAGTGCACATM 

TTAGAGTTTTTTTAATTMCACTTGTGTTGCTAAATTCTATT^ 

GACAACCATTTAAAATMTGGATGGGTTACTATGAGCMTTTCG^ 
15 TGCGCATTMTGAGGTTGAAGAGACTATGAGAMTATGTATAGTG^ ' 

AAMCAMCTTGAGATTTTTMCGTMCTATTMCACAG 

ATMTCCTTTTTGTMCCCAGGTTGGTTTCTAC 

CCCTTTGTTTTCCTCAMCAGCATGATTTTTTTGCACATGTAGAMTTTTO 
GATTTTCTTCACTTCCCTCTTCCTTTCTACTMCTCCTO 
20 AGGGTTGGTTTTTATTTTTATTTTTGCTATTTACCTAAAAAAAAAAAAAAAAA 

63 ' 

MEVSRRKAPPRPPRPAAPLPLIAYLLALMPGRGADEPVWRSEQAIGAI^ 

GNCTEPVSLAPPARPRPGSSFSKLLLPYREGAAGLGGLLLTGWTFDRGACEVRPLGNLSRNSLRNGTEWSCHPQGSTAGWTO 
AGRNNRWYLAVAATYVLPEPETASRCNPMSDHDTAIALKDTEGRSLATQELGRLKLCEGAG 
25 YTSGMTGWPSMARIAQSTEVLFQGQASLDCGHGHPDGRRLL^ 

QARAKRVSWDFKTAESHCKEGDQPERVQPIASSTLIHSDLTSVYGTVVMNRmFLGTGDGQLLKVI^ 

EETPVFYKLVPDPVKNIYIYLTAGKEVRRIRVMCNKH^ 

KCPKIQIIRSSKEKTTV'IWGSFSPRHSKCMVKW^ 

FTNCSSLKECPACVETGCAWCKSMRCIHPFTACDPSDYERNQEQCPVAVEKTSGGGRPKEMGNRTNQALQVFYIKSIEPQKV 
30 STLGKSNVIVTGANFTRASNITMILKGTSTC 

SL I FPATWI SGGQNITMMGRNFDVIDNLI I SHELKGNI NVSE YCVATYCGFLAPSLKSSKVRTNVTVKLRVQDTYLDCGTLQY 

REDPRFTGYRVESEVDTELmiQKENDNFNISKKDIEITL^^ 

VRmGNLELYVEQESVPSTWYFLIVLPVLLVIVtt 

sfgtvpfldykhfalrtffpesggfthiftedmhnrdandknesltaldalicm 
35 ialqtklvyltsilevltrdlmeqcs 

tckalytlnedwli.wqvpefswalnwfekipenesadvcrnisvnvlix:dtigqak^ 
qmgtrqkelldidsssviledgitklntighyei sngstikvfkkianftsdveysddhchl i lpdseafqdvqgkrhrgkhkf 
kvkeotltkllstkvaihsvleklfrsiwslpnsrapfm 

VFDIKKTPHIDGCLSVIAQAFMDAFSLTEQQLGKEAPTNKLLYAKDIPTYKEEVKSYYTCA 
40 NEFNEEVALTEI YKY IVKYFDE I LNKLERERGLEEAQKQLLHVKVLFDEKKKCKWM 

64 

GCGAGGAGGAAACGGTGCCGGAGCGCGCAGGGCTTGCTGCCGCCACCGCCGCTGCACAGGCTGGCGGAGCGAGCCTGCCGCGCG 

CCGCCCTCCCCGCTCTCCTTCCTGGGCGAGCTGCGGGGATGGGG^ 

CGCCGOCCGCGTCTCCGTTGCCGCGCGCOTOAGCCGOCfiTCGC^ 
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GGCCGGAACAACCGCTGGTACCTGGCGGTGGCCGCCACCTACGTGCTGCCTGAGCCGGAGACGGCGAGCCGCTGCAACCCCGCG 
GCATCCGACCACGACACGGCCATCGCGCTCMGGACACGGAGGGGCGCAGCCTGGCCACGCAGGAGCTGGGGCGCC 
TGCGAGGGCGCGGGCAGCCTGCACTTCGTGGACGCCTTTCTCTGGMCGGCAGCATCTACTTCCCCTACTA 
ACGAGCGGCGCTGCCACCGGCTGGCCCAGCATGGCGCGCATCGCGCAGAGCACCGAGGTGCTGTTCCA^ 
5 GACTGCGGCCACGGCCACCCCGACGGCCGCCGCCTGCTCCTCTCCTCCAGCCTAGTGGAGGCCCTGGACGTC 

TTCAGCGCGGCCGCTGGAGAGGGCCAGGAGCGGCGCTCCCCCACCACCACGGCGCTCTGCCTCTTCAGAATGAGTGAGATCCAG 
GCGCGCGCCMGAGGGTCAGCTGGGACTTCAAGACGGCCGAGAGCCACTGCAMGAAGGGGATCMCCTGAM 
ATCGCATCATCTACCTTGATCCATTCCGACCTGACATCCGTTTATGGCACCGTGGTAATGMCAGGAC 
ACTGGAGATGGCCAGTTACTTMGGTTATTCTTGGTGA 
10 GAGACACCTGTTTTCTACAMCTCGTTCCTGATCCTG 

ATTCGTGTTGCAMCTGCMTAMCATAMTCCTGTTCGGAGTGTTTMCAGCC 
CTACAMGGTGCACTTTTCMGGAGATOGTGTACATTCAGAGMCTTAGAAAA 

TGCCCTAAMTTCAGATAATTCGMGCAGTAMGAAMGACTACAGTGACTATGGTGGGAAGCTTCTCTCCMGAC 
TGCATGGTGMGMTGTGGACTCTAGCAGGGAGCTCTGCCAGMTAAMGTCAGCCCMCCGGACCTGCACCTG^ 
15 ACCAGAGCAACCTACAMGATGTTTCAGTTGTCMCGTGATGTTCTCCTTC 
ACCAACTGCTCATCATTAAMGMTGCCCAGCATGCGTAGAMCTO 

CCCTTCACAGCTTGCGACCCTTCTGATTATGAGAGAMCCAGGMCAGTGTCCAGTGGCTGTCGAGMGACATCAGGAGGAGGA 

AGACCCMGGAGMCMGGGGMCAGMCCMCCAGGCTTTACAGGTCTTCTACATTMGTCCATTC^ 

ACATOGGGAAMGCMCGTGATAGTMCGGGAGCAAACTTTACCCGGGCA 

20 acctgtgatmggatgtgatacaggttagccatgtot^ 

atgaaggatgtgtgtatccagtttgatggtgggaactg^ 

cttatatttcctgctaccacctggatcagtggtggtcaaaatatmccatc 

atcatttcacatgaattaaaaggamcatamtgtot 

mgagttcaamgtgcgcacgmtgtcactgtgmgctgagagtacm 
25 gaggaccccagattcacggggtatcgggtggmtccgaggtggacacagaact^ 

ttcmtatttccaaaamgacattgamttactctctto 

agamtcmgatcttaccaccatcctttgcaa^ 

CGGGTCMGCTGGGAMCCTGGAGCTCTACGTCGAGCAGGAGTCAGTTCCTTCCACATGGTATTT 

ttgctagtgattgtcatttttgcggccgtgggggtgaccaggcacam 
30 gmttgctggamgcgagctccggamgagatacgtgacggctttgctgagctgcagatggataaattc 

tttggaactgttcccttccttgactacamcattto 

actgmgatatgcatmcagagacgccmcgacmgmtgamgtctcacagcct^ 

cttgttactgtcatccacacccttgaamgcagmgmcttttctgtgmggacaggtgtc 

gcactgcamccmgctggtctacctgaccagcatcctagaggtgctgaccag^ 
35 ccgamctcatgctgagacgcacggagtccgtcgtcgaaamctcc 

cgggagactgtcggagagcccttctatttgctggtgacgactctgmccagaamttaacaagggtc 

tgcamgccctgtacacacttmtgmgactggctgttgtggcag&ttc 

gaaaamtcccggaamcgagagtgcagatgtctgtcggmtatttcagtcmtgttc 

amgaaaagattttccmgcattcttmgcaaamtggctctc 
40 atgggcacacgacagamgmcttctggacatcgacagttcctccgtg^ 

ggccactatgagatatcamtggatccactataamgtctttaagmgatagcamttttacttcagatgtggagta 

gaccactgccatttgattttaccagattcggmgcattccmgatgtgcmggaaagagacatc 

gtaamgamtgtatctgacamgctgctgtcgaccmggtggcmttcattctgtgc 

agtttacccaacagcagagctccatttgctataamtactttm 
45 gatcctgacgtcgtacatatttggaamcamcagccttcctcttcgcttctc 

tttgacattaagaagacacgacatatagacggctgtttgtcagtgattgc 

cagcmctagggmggmgcaccmctmtmgcttctctatgccmggatatcccmcctacaaagaagaagtaaaatcttat 
tacamgcmtcagggatttgcctccattgtcatcctcag 
gaatttmtgaagaagtggccttgacagamtttacamtacatc 
50 gmcgago^tggmgmgctcagamcaactcttgcatgtaamgtc 
gcactctcgggcctggcttmtctggcamgttc 
atttgttgtttgcacataggttccactttgggcactgtctc 
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ACGCGTCCGMGACATTMGTAAAAMTTGG 

TGGCATATMTCTGGAGTTCTTrrCACAACCTTACTTTTTCTGATTTGCT^ 
MTATGTTGTAMTTCTCCCTTGGCMGATTTCTC^^ 
5 MGGTGCCATGATGTATTCTTGGGTGCATTGGTTTTTTGCGCATTGTAAA 
GATGAGTTTCAGMCCTTTTACGTTCTCGGTAGAGG^ 
TTCAMTTCCAGTMGGCATAGCACTTOTMGAMTTAGM 
TATCTTATGTTACACTGGGAGTMTTTGAGGTGCMTTATTTTTATTACT 
AGAAMCTMGMGTMGTGTAACTTTTAMGTMGTATATATCA 
10 AGTTGTGTTTTCATGTCTCATCAAMGACMT 

CATAGGAAMTAGATATTTCCTAGATGATTTC^ 

mtagggatmtattgatatatctgttgcta^ 
tttgcccagtgtat7gaattggtggtagmggtagttccatc 
atagmtgtggctcagttctggtccttcmgcctgtatggtttggattttcagtaggggacag 
15 tggtacacaggaagctttataamtttcattcacgmtct^ 
mtmggmctaaagctttatatattgat^ 
ttgtacmamtgtattcaatagaaaatcatgatttattaata 

67 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPET^ 
20 CPFWNDISQCGRRDCAVKPCQSDEVPIXUKSASm 

DDIQSPEAEYVDLLLNPERYTGYKGPDAWK IWNVI YEENCFK PQTIKRPLNPLASGQGTSEENTFY SWLEGLCVEKRAFYRL I S 
GLHASINVHLSARYLLQETWLEKWGHNITEFQQRFrc^ 
DEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEAHKLKEDFRL^ 
IANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

25 68 

GCACGAGCCCCGGGCTGCCGGCGCGGGCGCCGCGGCACGTCCACAGGCTGGGTCGCGAGGTGGCGATCGCTGAGAGGCAGGAGG 

GCCGAGGCGGGCCTGGGAGGCGjGCCCGGAGGTGGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCC 

GCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTT 

TGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACA 
30 CCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAMCCATTGATAGA 

ACAAAMCTTCTTGAMGTGACTACTTTAGGTATTACMGGTAMCCTGMGAGGCCGTGTCCTTTC 

GTGTGGMGMGGGACTGTGCTGTCAMCCATGTCMTCTGATGAAGTTCCTGATGGM 

TGAAGMGCC^TMTCTCATTGAAGAATGTGAACMGC^ 

GMGGCTGTTCTTCAGTGGACCMGCATGATGATTCTTCAGATMCTTCTGTC 
35 ATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTACAAGGGACCAGATGCTTGGAA 

AGAAAACTGTTTTMGCCACAGACAATTAAMGAC^ 

TTACAGTTGGCTAGMGGTCTCTGTGTAGAAAAMGAGCATTCTACAGACT 

TTTGAGTGCMGATATCTTTTACMGAGACCTGGTTAGAAAAGAAATGGGGACACMC 

TGGMTTTTGACTGMGGAGMGGTCCMGMGGCTTMGAACTTCTATTTTCTC 
40 AGTGTTACCATTCTTCGAGCGCCCAGATTTTCMCTC 
. GGAMTACTTCATGAMTCAAGTCATTTOT 

ACTAAAGGAGGACTTTCGACTGCATTTTAGAAATATTO 

AMGCTTCAGACTCAGGGTTTGGGCACTGCTCTGMGATCTTATTTTCTC 

TAGTTATGMTTCCATCTMCCAGACMGAMTAGTATCATTAT 
45 AGAAMCTTCAGGAACTTGTTACAGMTATTCATTAMGAAMCMGCTGATATGTGCCT 

AGTGGMTTTCATTCAMGGCATMTAGC^^ 

ATATTGTTTTMGTAMCACATTTTTAAAMTTGTGTTMG 

TGTGGTACAMTTTTAMGTTTMTATTGA 

TCAMCTAGCGTTTTATGTMTAATATGTMTATAM^^ 
50 AGGCCTCATAMTGACCCATMmCCAACGTAGMTlTrTCAGTACATTTAGG 
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TTGCCCAGTTAGATTTGAATTC 

AATTATTOTTTGAMTTCACATTTMCTGGGA 

CTTTTAGAAGTATATTGCTATTTTGATTGG 

TAMCATTGTGAAMGTTGMTAGATTATATATTTATTCTCATAATACTTTC 
5 TTATTTTTATATMTTTCCMTTTACAGAAAAGTTTC 
ACGTGCTTTATCTTTCATGCTTTCTCTGTACACA 
ATCATGCCCCTTAMCCCTAMTACTTCAGTGTGTMTACT^ 
CAATTCTGGMCTATAATACTGTMGCCm 

CCGTTTTATGCTTCATGTGTATAGTMTAGGGTAGGCTATTTATTTTAT^ 
10 GCCCMGCTGGAACTTGMCGACTGGGCTGAAGTGATCTTCCCACCTCAGCCTCCCMGTAGCTGGGM 
CCATACCCAGTTTCATTTTTGTTTTTTATACCCGMGTTCATTTCCTTTC 
TTCATTAGTGGMGCTCTTCATTTATAMGCTATTTGM 
GCTATATACCTCAAAATCGTGCCCTCm^ 

GTTTATTTATATAGTGGTTMGAMTGTCAGTTTACACTGCTGTATACTTCTTCTTCTG 
1 5 AAGCACATACTTGGTATCCMTAMTATTTGTTGGATGAA 

70 

GGCACGAGGAGATGGCGGCAGCGGCGCTGGGGAGGGCGAGGCGGAGGCGGCAAMCGGGCGGTCGAGCAGMC 
TCCCCTCCAGTCCGCTCCGGGCAGCTGCTGATGCAAGGMTCCCCTG^ 

CTGTGCTGAGTCTTCCTGCAGGCCTTTCCTTGCCTCTGTGGGACCCTGTGGGGGTCCATCCGGCTGGAGAAGA 
20 TGCTAACGTTGCAGACCCCAGAGGGTCCTGTGTGGGTGTG^ 

CCTGCTGAGTCACTCCCCAGGTACCTCCMTCAGAGCCAGCCCTGTTCTCCAAAGCCAATCCGCCTGGTTCAGGAC 
' GGAGCTGGTGCATGCAGGCTGGGAGMGTGCTGGAGCCGGAGGGAGMTC 

CCTGTGGGAGATGCCCGTGCTGGGGCAGCACGATGTGATTTCGGACCCTTTGG 

MGCTTGGTGGAMCTCCCCCGGCTGAGMCMGCCCAGAAAGCGGCAGCOT 
25 GMGCCCAAGATTGAMTCCCAGTGACACCCACAGGCCAGTCGGTGCCCAGCTCCCCCAGTATCCCAGGAACCCCAACGCTGAA 

GATGTGGGGTACGTCCCCTGMGATAMCAGCAGGCAGCTCTCCTACGACCCACTGAGGTCTACTGGGACCTGGACATC 

cmtgctgtcatcmgcaccgggggccttcagaggtgct(£ccccg^^ 

gmgcttcggcagcactatcgggagctgtgccagcagcgagagggcattga 

ggagcgcmggtggtagacaaaggatctgaccccctgttgcccagcmctgtgmccagtc 

30 mtcatgmcgacattcctatcaggttatccc^ 

cgccaggcggctcatcgagtccaggagtgcatcccctgacagtaggmggtggtcamtggmto 
gcttcggaaggaccactcagcctccaaggaggactacatggatcgcctggagcatctgcggaggcagtgtggcccccacgtctc 
ggccgcagccmggactccgtggmggcatctgcagtaagatctaccacatctccctggagtacgtcaaacggatccgagagaa 
gcaccttgccatcctcmggaamcmcatctcagaggaggtggaggcccctgaggtggagc 

35 agtccggctggctgtgtctgcaccgcccatgcccagcgtggagatgcacatggagmcmcgtggtctgcatccgg 
agagatggtcmggtcagccgcmctacttcagcmgctgtggctccm 
gaggttcctgccccgggtctggtgtcttctccgacggtaccagatgatgttcggcgtgggc 
gggatcgctgcctgtgcatgtctttgaggccctccaccgactctttggcgtcagcttcgagtgcttcgcctcacccctcmc^ 
ctacttccgccagtactgttctgccttccccgacacagacggctactttggctcccgcgggccctgcctagac 

40 gagtggttcatttgaggccmccctcccttctgggaggagctcatggatgccatg 
ctcaccggagcccctgtccttcatcgtgttcatccctgagtgg03ggmcccccmc 
ccgcttcamcgccaccagttgatcctgcctgcctttgaggatgagtaccgcagtggctccc^ 
aatgcactacmggccgtccacmcacggctgtgctcttcctacagaacgaccctggct™ 
acggctgcaggagctgagtgctgcctaccggcagtcaggccgcagccacagctctggttcttc 

45 caaggaccgggactcgggccgtgagcagggtcctagccgcgagcctcaccccacttmcatatc 
aggggtgctagtctggactgctgggactcgggcccctggggcctcagagggacccc 

TTCTGCCAGGGCTCCCCTCCCTGCCTGTCCCCMGTCCTCACCTCAMCTCCCTCCMGTC 

TTCCCMCCCCGCCCCTCACCCTGTTCXXACCTTGTTTC^ 

AAAAAAAAAAAA 



50 72 
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TATCATGCATGTGGGMCX3TGGGTGTGGTGAG 
GGAAMCTGGCAGTATTTTGAACTGCATCTO 

GCGCAGTAGCAAAGATGGTCMGGCCATGGAAAMGCAGAMTTTACCMG^ 
TCAMTACATCTGCTATCTTTTTMGCTMGTCCTAGACATATC 
5 MGCAGAGAMTGTMTTCCATATTTTATTTGAMCTTATTCCATATTW 
CACAMCTTTMTTTTGTTAMT^ 
TATTTTACCATCTTTMTCATCTTGGAAM 

AGGCTTTAMTTATMCATATGGGATTCTTAGTAGTATGTTTTTTTCTTGAAA 

TCACTCTTTCTCTAAGACTAMCTCTAGGCTCTT^ 
1 0 MTATCTTTTMTAGTMCATGTATTTTATGGACCAM 

CACACTGAGTTGGGMTTTCTTATCCCAGMGACCMCCAATTTCATATTTATTTMG 

AGMTCCCTGCAGTCTCCTTAMGGTAGMCAMTACTTTCTATTTTTTTTW 

CTCTAAAAAAACAGAGAACAAATATGTCTCAGTO^ 

TGTGCACCTTTTGGCMCTTCTCTTTTCMTGTAGGGAAAMCTTAGTC 
15 ATGTGGGCAGMGGTTTGGGGGTGGACATTGTATGTGTTTAAATTAMCCCTGTATCACTC 

AAMTGMTGCTTAGMGCTGTTCACATCrTCMGAGCAGMGCAMC^ 

CATAMGTGAATCATTTCTTCTGTATTMTTO 

TGC&TTGATATTTTTCm 
• GTTCTGTGGTTGATGTTGTTTGTTTC^ 
20 CATACTACATGCAGTTCTTTMCCAATGTCTGTTTGGCTMTGTMTTAMGTTG 

TCMTGGTTTCTTMTATTTATTGTGTAGM 

GTTTTCATGTGTTTATAGCAGMGmTTTATTTCTATGGCATTC 

TTTTGTACAGTTAGTGGACAGTATTCAGCMCGCCTGATAGCTTCTTTGGCCTTATGT^ 

73 

25 mreivhiqagqcgnqigakfwevisdehgidpsg^^ 
frpdnfifc^sgagnnwakghytegae^^ 

wps pkvsdtwepynatls ihqlventdetyc idnealydicfrtlklatptygdlwhlvsatmsgvttslrfpgqlnadlrk 
lavnwpfprlhffmpgfapltrrgsqqyraltvpel^ 
nssyfvewipmwkvavcdipprglkmsstfignsm 
30 eyqqyqdataeeegemyeddeeeseaqgpk 

atgcgggagatcgtgcacatccaggccggccagtgcggcmccagatcggggccmgttctgggmgtc 

ggcatcgaccccagcggcmctacgtgggcgactcggacttgcagctggagcggatcagcgtct 

cacmgtacgtgcctcgagccattctggtggacctggmcccggmccatggacagtgtccgctcaggggcctt^ 

35 ttcaggcctgacmtttcatctttggtcagagtggggccggcmcmctgggccm 
gtggattcggtcctggatgtggtgcggmggagtgtgaamctgcgactgc^ 
ggggggacgggctccggcatgggcacgttgctcatcagcmggtgcgtgaggagtatcccgaccgcatc 
gtcgtgccctcacccmggtgtcagacacggtggtcxsmccctacmc^ 
gatgamcctactgcatcgacmcgaggcgctctacgacatctgcttccgcaccctcmgctggcc 

40 ctcmccacctggtatcggccaccatgagcggagtcaccacctccttgcgcttcccgggccagctcaacgctgacctgcgcaag 
ctggccgtcaacatggtgccctocccgcgcctgcacttcrrcatgcccggcttcgcccccc 
taccgggcotoaccgtgcccgagcteacccag^ 
cgctacctgacggtggccaccgtgttccggggccgcatgtccatgmggaggtggacg 
mcagcagctacttcgtggagtggatccccmcaacgtgmggtggccgtgtgtgacatcccgcccc^ 

45 tccaccttcatcgggmcagcacggccatccaggagctgttc 

gccttcctgcactggtacacgggcgagggcatggacgagatggagto 
gagtaccagcagtaccaggacgccacggco^ggmgagggcgagatgtacgmgacgacgaggagga 
cccaagtgaaactgctcgcagctggagtgagaggcaggtggcggccggggccgaagccagcagtg^ 
cttgctgccgacaccctgctttccccatcgccctagggctcccttgccgccctc 
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CCTAGGCCACGTGTGAGCTGCTCCTGTCTCTGT 
GTGTTTATATTTTCGGGGATACTTMTAM 

MAEGETESPGPKKCGPYISSVTSQSVNLMIRGWLFFIGVFLALVL^ 
5 GLLYPCIDRHLGEPHKFKREWSSVMRmVFVG^ 

TQLLVYNGVYQYTSPDFLYVRSWLPCIFFAGGITMGNIGRQLAMYECKVIAEmilRNEEGKKYLLYRKAR 

76 

ACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGTGGGATTTCGG^ 

GCGTCAGTCTTTGATTCACAGACAGTTGAGCTTTTCAGCTGGGMGCCT^ . 
10 ATGAMCCATGGCAGMGGAGAGACAGAGTCACCTGGGCCCAAAMGTGTGGCCCATATATTTCATCT 

TGAACTTGATGATTCGAGGAGTAGTGCTATTTTTTATTGGAGTATTTC 

ATGTGACGCTCTTTCCACCTGATGTGATTGCMGCATC 

CTGTGATTGGGTTATTATACCCCTGCATTGACAGACATCTAGGAGM 

GGTGTGTAGCAGTCmGTTGGTATAMTCATGCCAGTGCTAMGTGGATTTCGAT^ 
15 CTGCACTATCCATTGGACTGTGGTGGACTTTTGATAGATCTAGM 

CTGTGGTCACTCAACTGCTAGTATATMTGGTGTTTACCMTATACATCTCCAGAOT 

TCATCAGGMTGMGMGGCAAAAMTATCTTTTGTACAGAAAAGCMGATGAAMGGATGTGAAATG^ 
AACTTCAGACTGTAAAATTK^ 
20 ACACATATTACTGCAATCTGTGATTGCTTCATCTGTAAATCAGTTC 
TATATGTACTACATTAAAMGTGTTGATTMTAGATGAMTT^ 
TGTTMTGTGCCMGATATTGTTCCTGTCATG 

AGAGGGAMGGCCAAAMCAMCAMCAAAAGGCATATGGGGAGCTG^TATm 
ATTTTTCCTMTCCACGCCGGTATGGGGTTTGGGGGGCCCCMTTGTTGG 

25 77 

MSKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYIEKSLLVPDHVITRLMMSELENRRGQH 
WLLDGFPRTLGQAEALDKICEVDLVI SLNI PFETLKDRLSRRWIHPPSGRVYNLDFNPPHVHGI DDVTGEPLVQQEDDKPEAVA 
ARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY 

78 

30 CGGCGCTGGGCTGAGGGGAGGGGTTGTCTTAAAAGTCTCTCCTTCCCCCTGTAGGGGCGGCCGGCGAGTCCCAGTGAGAGCGGA 
GGGTGCCAGAGGTAGGGGGCCGAGAMCAMGTTCCCGGGGCTTCCTCCGGGGCCGCGGTCGGGGCTGCGCGTTTGACCGCC 

cctcctcgcgmgcaatggcttccamctcctgcgcgcggtcatcctcgggccgcccggctcgggcm 
aggatcgcccagmctttggtctccagcatctcto 

gagatggcaaagcagtatatagagaamgtcttttggttccagaccatgtgatcacacgcctmtga 
35 aggcgtggacagcactggctccttgatggttttcctaggac 

CTAGTGATCAGTTTGMTATTCCATTTGAAACACTTAAAGATCGTCTCAGCCGCCGTTC 
TATMCCTGGACTTCAATCCACCTCATGTACATGGTATTGA 

CCCGMGCAGTTGCTGCCAGGCTMGACAGTACAMGACGTGGCAMGCCAGTCATTGMTTATACM 
CACCMTTTTCCGGMCGGAGACGMCAAMTCTGGCCCTACGTTTACACACTTTTC 
40 AMGAAGCATATOACCCTGCCCMTGGAAGMCCAGGMGATGTGGTCATTCATTCMTAG 
TCCAAATTAGMGCTAGCTGAGGTAGCTTGCAGCATCTTTTCTAGTTGAM 
GAGTTCATGMGAGGCCCTCCTCTGCCTTTC 

CATCACMGAMGCMGTACAGTGTGGATTTCAMTGGTGTGTMCTTC 
MTATTTTTGACATGTGATGGTGATAGTCTCTGGTTCTCCCCATCCCC^ 
45 GAATGMTCCTGAGGGCTCTAGCCCAGGCITTC 
TTACTCATAGTGGTTGTTTCTCTGGTCTTC 
TCCACGCTCCATAGAGTCTCTCCTTTTC^^ 
GTTTTCACTCTTAGGCTTTTAAACMTAGTTATTGCTTTTArc 
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MTCTCTGTATGCACTGAGAACTGAGCTATGMGAG 
TCTTTTATTGTGAAAAAAAAAAAAAAA 

79 

MGSVMSAGVPRGCDILIVYSPDAEEWCQYLQTLFLSSRQVRSQKILTHRLGPEASFSAEDLSLFLSTRCWVLLSAELVQHFH 
5 KPSLLPLLQRAFHPPHRVITCLLCGVRDSEEFLDFFPDWAHW^^ 

- KQQNLPTVTS PGNLMWQPDRI RCGAETTVYVI VRCKLDDRVGTEAEF S PEDS PSVRMEAKVENEYTI SVKAPHLS SGNVSLKI 
YSGDLWCETVISYYTDMEEIGNLLSNAANPVEFMCQAFKIVPYOT 

DEELPTLLHFMKYGLKNLTALLLTCPGALQAYSVANKHGHYPOTIAEKHGFRDLRQFIDEYVETTO 
AVYESMAHLSTDLU4KCSLNPGCDEDLYESMAAFVPMTEELYVE^^ 
10 RDQCHLGQEEDVYHTVDDDEAFSVDLASRPPV^^ 

WRDRPQSSIYDPFAGMTPGQRQLITLQEQVKLGIVNVDEAVLHFKEWQLNQKRRSESFRFQQENLKRLRDSITRRQREKQKSG 
KQTDLEITVPIRHSQHLPAKVEFGVYESGPRKSVIPPRTELRRGDWKTDSTSSTASSTSNRSSTRSLLSVSSGMEGDNEDNEVP 
EVTRSRSPGPPQVDGTPTT4SLERPPRVPPRAASQRPPTRETFHPPPPVPPRGR ■ 

80 

15 AGACCCCCCAGGGTGCCTCCGAGAGCTGCCTCACAGAGGACTCCGACCAGGGAGACCTTCCATCCTCCTCCACCTGTTCCACCC 
AGAGGACGCTGATTCCACCTCCTAAMCCTGCCTACTTCAGGACTT^^ 
TGTTGAGTTTTATAGCATGACTGTTGACCTTAAGATCCATTCTCATTGCTGATM 

CGMGATTTTATTMGGCACGMGMGTGAAAMCTAAGGGCTTCATTCACCATCACCMGTATATCGAACCATATACTTGTTT 
GCCAAMGGGTGGMGGACTTTMTTCGGAMTACTTTACCTCTTMTTTGGCCATATTCAG 
20 TMTGTTATTTCACCGTGGTTTTTATTGAATGG 

82 

GGCTGGCTATATATGCATTGATGATTGTCCTATTATATTAGTTATAAAGTGTCATTTAATATG 
GGAMGAGTAGTTGAAMCATAMCATTTGGGCCTTTCAAG 
25 GTCAGGG 

84 

GCTTGAGATCTCCAGCCTTACCGCGGCTCGAMTGGCCCCMCTGCTCCTGCACCACTGGTGTCTCCTGCANCTG 
CTGCAAGTGCAAAGAGTGCAMTGACCTCCTGCMGAAGAGCTGCTGNNCTGCTGCCCCGTGGTGTC 
GTCTGCAAAGGGACGTTGGAGMCTGCAGCTGCTGTNCIX5ATGTGGGMCMGCTCTTC 
30 CACMCCTGGGATTTTTTTTCAATACGATACTTGAGCCATTTTCTGCATTTC 
ACAATTTTGACTTTGAATCTTAAAAA 

85 

MPLGHIMRLDLEKIALEYIVPCLHEVGFCYTjDNFLG^ 

EEGCEAISFLLSLIDRLVLYCGSRI^KYYVKERSKAMVACYPGNGTGYVRHVDNPNGDG 
35 PEGKSFIADVEPIFDRLLFFWSDRRNPHEVQPSYATRYAMTVWFDAEERAEAKKKFRNLTRKTESALTED- 

86 

GAGTCTGGCCGCAGTCGCGGCAGTGGTGGCTTCCCATCCCCAAMGGCGCCCTCCGACTCCTT^ 

CAGTCCGGAMCGGGTCGTGGAGCTCCGCACCACTCC^^ 

TTTCCCTTGTCCCCGACGCTGCAGCGGCTCGGGTACCGTGGCAGCCGCAGGTTTCT^ 
40 GGCTTCGCGCTCGTGTAGATCXSTTCCCTCTCTGGTTGCACGCTGGGGATC 

GACACATCATGAGGCTGGACCTGGAGAAMTTGCCCTGGAGTACATCGTGCCCTGTC 

ACMCTTCCTGGGCGAGGTGGTGGGCGACTGCGTCCTGGAGCGCGTCMGCAGCTC 

AGCTGGCGG^^CGCGCGCCGGCGTCTCCMGCGACACCTGCGGGGCGACCAGATCACGTGGATC 

GCGAGGCCATCAGCTTCCTCCTGTCCCTCATO 
45 AGAGGTCTAAGGCMTGGTGGCTTGCTATCCGGGAMTGGMCAGGTTATGTTC 

GCTGCATCACCTGCATCTACTATCTGMCMGMTTGGGATC 
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MTCATTCATAGCAGATGTGGAGCCCATTTTTGACAGACTCCTGTTCTTCTC 
CCTCTTACGCAACCAGATATGCTATGACTGTCTGGTAC 
TMCTAGGAAMCTGMTCTGCCCTCACTGMGACTGACCGT^^ 
TGMTTCTCTTAAATTCTTTGAGATCCAMGATGGCCTCTTCAGTGACM 
5 CCTGTCTTGTGTGTGGTACTTCATGTTTTCTTGCCAAG 

CTMCTCCAGAMTTCCTGCAGACATCCTACTCGGCCAGCGGTTTACCTGATAGATTC 
GGAGCACAGCGAGGGMTGAACCTTACTTGCACTTTATGTATACTTCCTGATTTGAM 
GAGGTGGTAGATGCCACAGAGAGGCATCACGGMGCCTTMCAGCAGGAAACAGAGAAA 
ATGTTCTTMTCCAGGGCTGTTGGGGTTTCTGGAGMTTATCACM 
10 TAGTGMCMTTTATMGTGTCCCATGGGGCAGACACT 
TTTGCTGAAMTAATGACTTTCTGMTAMGATGGCMC^ 
CCAGMGCTAAAAMCTAGACATTAGTTGTTTTGGTTGCTT^ 

CCTGGTAGTTTTGTGTTMCCACTGATAACTGTGGAAAGAGCTAGGTCTACTGATATACMTAMCATG 

1 5 GAGCTCACCCTTTMCACAAAAAMGCAGGGTGATGTATTTTAAAAMGGMGTC^ 

TTCTCGTCTGTCCCTAGCAGTCTTTCTTCAGCTCACTTGGCTCTCTAGATCCACTG^ 

ACTTGCTAGMCTGTGGMGCTTCTACTCCTGCAGTMGCACAGATCGCACTGCCTCAATMC™ 

GCAAMGCTACTTTTCCTAGTTTTCAG 

TTAMGAGCCAGAMTATTTTCCTTTGTTATGTACGGATATATATA^ 
20 CTTCTGGTTTTGGAAMTTTCCAGATMGACATGTCACCATTMTTCTCMCGACTGCTCTA 
TCACCTTCTAMTTACTATGTMTTTACTCACTT^ 
ACAGGMTTTTGTATATTTTTMCCAATGCMCATACTCTCAGCA 

TACTTGTTGAATAMCTCAGTCTCCTACATTAGCATTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
87 

25 merrrlwgsiqsryismsvwtsprrlvelagqsllkd^ 
vlmk<^hlhletfkavlix3ld^ 

eqpfipvevlvdlflkegacdelfsyliekvkrk™?lrlcckklkifmpmqdikmilkmvqldsiedle^ 
spyi£qminlrrlllshihassyispekeeqyiaqftsqflslqc^^ 

gdvmhlsqspsvsqlsvlslsgmmspeplqallerasatlqdlwdecgitddqllallpslshcsqlmsfygnsisi 
30 salqsllqhliglsnlthvlypvplesyedihgtlhlerlay^ 

CFMPN 
88, 

gcttcagggtacagctcccccgcagccagmgccgggcctgcagcccctcagcaccgctccgggacaccccacccgcttcccag 

gcgtgacctgtcaacagcmcttcgcggtgtggtgmctctctgaggaaaaaccattttga™ 
35 cmcmgtgactgagacctagamtccmgcgitcg 

gtggggttccattcagagccgatacatcagcatgagtgtgtggacmgcc^ 

gctgaaggatgaggccctggccattgccgccctggagttgc^ 

cgggagacacagccagaccctgmggcaatggtgcaggcctgg^^^ 

acatcttcacctggagaccttcamgctgtgcttga 
40 acttcmgtgctggatttacggmgaactctcatcaggacttctggactgtat^ 

tccagagccagmgcagctcagcccatgacamgmgcgaamgtagatggtto^ 

agtagaggtgctcgtagacctgttcctcmggaaggtgcctgtgatgmttgt^ 

gaaamtgtactacgcctgtgctgtmgmgctgmgatttttgcmtgcccatgcagga 

gcagctggactctattgmgatttggmgtgacttgtacctggmgctac 
45 gatgattmtctgcgtagactcctcctctcccacatccatgcatcttcctacatt^ 

ccagttcacctctcagttcctcagtctc^^ 

TCAGTTGCTCAGGCACGTGATCMCCCCTTGGAMCCCTCTCMT 

GTCCCAGAGTCCCAGCGTCAGTCAGCTMGTGTCCmGTCTMGTGGGGTCATGCTGACCGATGTAAGTC 
AGCTCTGCTGGAGAGAGCCTCTGCCACCCTCC^GGACCTGGTCm^ 
50 CCTGCCTTCCCTGAGCCACTGCTCCCAGCmCMOT 
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i 

CCTGCAGCACCTCATCGGGCTGAGCAATCTGACCCACGTGCTC 

CCTCCACCTGGAGAGGCTTGCCTATCTGCATGCCAGGCTCAGGGAGTTGCTGTC 

TAGTGCCAACCCCTCTCCTCACTGTGGGGACAGAAC^ 

GCTGGGTGCACATATCAMTGCTTCATTCTGCATACTTGGACACTAMGCCAGGATC 
5 CCACAGTTTCAGACAMTGTTCAGTGTGAGTGAGGAJ^ 

AAAAMGGGGMGTTGGGGATAGGCAGATGTTGACTTGAGGAGTTMTGTGATC 

ATAGMTCTGMTTTCTAMGGGAGATTCTGGCTT^ 

TGTTGAAAATAAAGAGAAGCAATGTGAAGCAAAAAAAAAAAAAAAAAA 

89 

10 MANDSGGPGGPSPSERDRQYCELCGKMENLLRCSRCRSSFYCCKEHQRQDWKKHKLVCQGSE 
PPI&GAREPRKAAARRDNASGDAAKGKVKAKPPADP& 
PGDALSPGGGLRPNGQTKPLPALKLALEYIVPCMNKHGICV^ 

GDKITWI EGKEPGCETIGLLMSSMDDL I RHCNGKIX3S YKINGRTKAMVACYPGNGTGYVRHVDNPNGDGRCVTCI YYLNKDWDA 
KVSGGILRIFPEGKAQFADIEPKFDRLLFFWSDRRNPHEVQPAYATRYAITV^ 
15 VGKDVF 

90 

TTAGGGGCAGAAAAACATTTGTMTMTTMTGGCTTTC 
• TAGTGCTCCTMTCATACMTATTMGGATTGGG 

GTAGMTMTAGTACCCACTTCATAGCATTGTATGATGATTAMTTGGTTAATATTTTO 
20 ACATMCAGCMGCACCACATGTGTTTATMGATAMTTCCTTTGTGTTGCCTTCCGTT 
ATAMTACTTGCATGACATTTTGMGTCTCTCTATMCATC 
CGTGTTGGTGACMGATTGTTCACCAGCATATGGTGTGGTGAAAACTC 
TAGGTGAAMTCCTGAMTCMGGATCTTTGGMCTATTTGAMTCAGTATTTT 

gcmgtgttctattosatggattmgtat^^ 
25 gctmtaaaggt!itrcatatggacatctattcttttmgtamcttcm 
tmtttgtggacacaccacagactgamtagcamt™ 
tagtaaagttcctgctcttgtgaatmttaagcctcatgtatm 

CTTGTATTTAMTTACATTTTACATAAT^ 

TTTATATTTTMGAATTTCTGMTTAATTTGCTTGGATm 
30 CCATTCCATCCTATTCATCCCTCTTTTAGGAAACTCTGAACTCTGG 

ATGTGTCTCTGAGTGTAGTATGACTMTTCATTTGTTTC^ 

CTGCMCMTGACATTTTACTGTATTT^ 

CAGAMCTTGGATTTTTAGTTCGGTTTGAMTTTGM 

ATTATCATCCTCTTTCACTTGATTGGATCCCCAGMTGCCATTTACGCATC 
35 TATTTGATGMGAGGTTTTATATm 

• TGCTTTTGAAATACATATTACMCTTTTTM 

CAAGCCTAGCACCCCACTTTCCACATAGTGTTCGAT^ 

TAGCAGCAGAMGAMCATCTTTGGCTTATTTCTGGTAAGGTTTTTATGCTCTGTAAM 

CACAGATTCTATTAAAAATAMTGTGAGAGTCGTTAATC 
40 CCATCAGTTTAAATTGGATATTGGMTGAGCATTGAT^ 

GGCGGGATTTCAAAGGTTTTAAAMTGAGTTTTTGATTTTT^ 

GGGGCGCMGAGATTGGATTMGACCATAGTMTACTTATTTTGTTCTTAACCA 

GGTGTMTGGAAAAMCAGCCTTGGMTCTGGGAGCCTGATTtt 

TTTACTTCTCTGMTTTCCGTT^ 
45 CACACCCCGCGCCGGCCTGGAGGMGAGCAGCCATGATTACGCCGCCTTCGCTCCGCTACCCGCTTGCGGCTGGCGCCCTC 

CAGCAGGTGTAGGCGCTGCCGCGCTGCCCCACGCCm 

CGGGCTGAGGTAGCGCACCGGCCTCTCGGCGTCCCAGTCCGGTCCCGGGCGGAGG^ 
AGCCGAGGCCCGGCCCCGCCGAGTGCGGAGGAGCGCAGGCAGCCCCCGCCCCTC^ 
TCCGCCCCCTCCGCCCTCGCGCGCCGCCCGCCCGGGTCGCCGCGGGGCCGTGGTGTACGT^ 
50 CCCGTATGCCCTGCGCTCCTCCACAGCCTGGGCCGGGCCGCCCGGGACGCTGAGGCGGCGGCGGCGGC 
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GCGCTCCCCAGGCCCGCGCGCCTGAGCCCAGGTTGCC^ 

GCTGAGGCGGCTGCCGGCGCTGCGCCGGAGCTTAGGACTCGGAAGCGGCCGGGCCG^ 

CGAGGGTAGCGGGTGCATCGCGCAGTMCGGCCCCTATCTCTCTCCCCGCTCCCCAGCCTCGGGCGAGGCCGTCCGGCCGCTAC 
CCCTCCTGCTCGGCCGCCGCAGTCGCCGTCGCCGCCGCCGCCGCCGCCATGGCCAATGACAGCGGCGGGCCCGGCGGG 
5 CCGAGCGAGCGAGACCGGCAGTACTGCGAGCTGTGCGGGMGATGGAGMCCTGCTGCGCTGCAGCCGCTGCCGCAGCTCCTTC 
TACTGCTGCAAGGAGCACCAGCGTCAGGACTGGAAGAAGCACAAGCTCGTGTGCCAGGGCAGCGAGGGCGCCCTCGGCCACGGA 
GTGGGCCCACACCAGCATTCCGGCCCCGCGCCGCCGGCTGCAGTGCCGCCGCCCAGGGCCGGGGCCCGGGAGCC 
GCGGCGCGCCGGGACAACGCCTCCGGGGACGCGGCCAAGGGAAAAGTAAAGGCCAAGCCCCCGGCCGACCCAGCGGCGGCCGCG 
TCGCCGTGTCGTGCGGCCGCCGGCGGCCAGGGCTCGGCGGTGGCTGCCGAAGCCGAGCCCGGCAAGGAGGAGCCGCCGGCCCGC 

10 TCATCGCTGTTCCAGGAGMGGCGMCCTGTACCCCCCAAGCAACACGCCCGGGGATGCGCTGAGCCCCGGCGGCGGCCTGCGG 
CCCMCGGGCAGACGMGCCCCTGCCGGCGCTGMGCTGGCGCTCGAGTACATCGTGCCGTGCATGAACAAGCACGGCATCTGT 
GTGGTGGACGACTTCCTCGGCAAGGAGACCGGACAGCAGATGGGCGACGAGGTGCGCGCCCTGCACGACACCGGGMGTTC 
GACGGGCAGCTGGTCAGCCAGMGAGTGACTCGTCCMGGACATCCGAGGCGATAAGATCACCTGGATCGAGGGCM 
GGCTGCGAMCCATTGGGCTGCTCATGAGCAGCATGGACGACCTGATACGCCACTC^ 

15 MTC&CCGGACGAAAGCCATGGTTGCTTC^ 

GGMGATGTGTGACATGTATATAmTCTTMTAMGACTGGGATGCCMGGTMGTGGAGGTATAC 
GGCAMGCCCAGTTTGCTGACATTGMCCCAMTTTGATAGACTGCTGTTTT^ 

CMCCAGCATATGCTACMGGTACGCMTMCTGTTTGGTATTTTGATGCAGATGAGAGAGCACGAGCTAMG 
ACAGGTGAAAAAGGTGTGAGGX5TTGMCTCMTAMCCTTCAG 
20 MTACCCCACTTCACCTACAATATTGTTMCTATTTGTTMCTTGTGAATACGAATAAATG 
GrrCGCAmTMTMGGAMCAGAMCAACTTTTTGTGTTGCATCAAACA^^ 

GATCMCTTCAMTCTGTGATTGCTTACAGGAGGMGATMGCTACTMTTGAAMTGGTTTTTACATCTGGATA 
TGCCCTGTGTAGMTTTTTTTCATTCTTATATTTTGCCAGATCTGTTATC 
TCMGTTTGMTTTGGGATMTTTTTCTATATO 
25 AAMTMCATCAATCTATTTTTGTAMCCTGTTCATACTATTAAATTTOCCCT 
TGACTGATGATTMTTTTATTmCTTAAAATAAGAAMGGAGCACmMTTAC 
TTCCTTACACTMTTTCMCTCTTAMGA 
CATTTACTACCMTMCTTTTAGTTAATGTTTTACAAGG 

91 . 

30 MLRTAMGLRSV^MPWGALPPRPPLLLLLLL^ 

GAREALFALSSNLSFLPGGEYQELLWGADAEKKQQCSFKGKDP^ 
ARDEKGIWLLEDGKGRCPFDPNFKSTALVVDGELYTGTVS 

LQGDDDKIYFFFSETGQEFEFFENTIVSRIARICKGDEGGER\njQQRVJTSFLKAQbLCSRPDIX5FPFNV^ 
mFYGVFTSQWHRGTTEGSAVCVFTMDVQRVFSGLYK^^ 
35 NFLKDHFLMDGQVRSRMLLLQPQARYQRVAVHRVPGLH^^ 

THRGLLYMSHSGWQVPMANCSLYRSCGDCLLARDPYCAWSGSSCKf^ 
VPTGEKPCEQVQFQPNTVNTLACPLLSNLATRLWL 

EWEDGVADQTDEGGSVPVI I STSRVSAPAGGKASVJGADRSYWKEFLVMCTLFVLAVLL PVLFLLYRHRNSMKVFLKQGECASV 
HPKTCPmPPETRPIiNGI^PPSTPLDHRGYQSLSDSPPGARWTESEKRPL'SIQDSFVEVSPVCPRPRVRLGSEIRDSW 

40 92 

GCTCTGCCCMGCCGAGGCTGCGGGGCCGGCGCCGGCGGGAGGACTGCGGTGCCCCGCGGAGGGGCTGAGTTTGCCAGGGCCCA 
CTTGACCCTGTTTCCCACCTCCCGCCCCCCAGGTCCGGAGGCGGGGGCCCCCGGGGCGACTCGGGGGCGGACCGCGGGGCGGAG 
CTGCCGCCCGTGAGTCCGGCCGAGCCACCTGAGCCCGAGCCGCGGGACACCGTCGCTCCTGCTCnCCGAATGCTGCGCACCGCG 
ATGGGCCTGAGGAGCTGGCTCGCCGCCCCATGGGGCGCGCTGCCGCCTCGGCCACCGCTGCTGCTC 

45 CTGCTGCAGCCGCCGCCTCCGACCTGGGCGCTCAGCCCCCGGATCAGCCTGCCTC 
TTCGAAGCTGMCACATCTCCAACTACACAGCCCTTCTGCTGAGCAGGGATGGCA 
CTCTTTGCACTCAGTAGCMCCTCAGCTTCCTGCCAGGCGGGGAGTACC^ 
CAGCAGTGCAGCTTCMGGGCMGGACCCACAGCGCGACTGTCAAMCTACATCMGATCC 
• CTGTTCACCTGTGGCACAGCAGCCTTCAGCCCCATGTGTACCTACATCM 

50 GGGMTGTCCTCCTGGMGATGGCMGGGCCGTTGTCCCTTCGAC 
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CTCTACACTGGMCAGTCAGCAGCTO 

AGCTCCCTCMCTGGCTGCMGACCCAGCTTTTGTGGCCTCAGCCTACATO 
GACMGATCTACTTTTTCTTCAGCGAGACTGGCCAGGMTTTC 
TGCMGGGCGATGAGGGTGGAGAGCGGGTGCTACAGCAGCGCTGGACCTCCTTCCTCM 
5 GACGATGGCTTCCCCTTCMCG'IKXITGCAGGATGTCTTCACGCTGAGCCC 
GGGGTCTTCACTTCCCAGTGGCACAGGGGAACTAC 

TTCAGCGGCCTCTACAAGGAGGTGMCCGTGAGACACAGCAGTGGTACACCGTGACCCACCCGGTC^ 

GCGTGCATCACCMCAGTGCCCGGGAMGGMGATCMCTCATCCCTC^ 

CACTTCCTGATGGACGGGCAGGTCCGAAGCCGCATGCTGCTGCTGCAGCCCCAGGCTCG 
10 GTCCCTGGCCTGCACCACACCTACGATGTCCTCTTCCTCGGCACTGGTGA 

CGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATCGGGACAGCCCGTGCAGM 
■ ■ CTGTATGCGGCCTCACACTCGGGCGTAGTCCAGGTGCCCATGGCCMCTGCAGCCTGTACAGGAGOT 

GCCCGGGACCCCTACTGTGCTTGGAGCGGCTCCAGCTGCMGCACGTCAGCCTCTACCAGCCTC 

ATCCAGGACATCGAGGGAGCCAGCGCCMGGACCTTTGCAGCGCGTCTTCGGTTGTGTCC 
15 MGCCATGTGAGCMGTCCAGTTCCAGCCCMCACAGTGMCACTTTGGCCTGCCCGCTCCTCTCCAACCTGG^ 

TGGCTACGCMCGGGGCCCCCGTCMTGCCTCGGCCTCCTGCCACGTGCT 

CAGCTGGGGGAGTTCCAGTGCTGGTCACTAGAGGAGGGCTTCCAGCAGC.TGGTAGCCAGCTACTGCCCA 

GGGGTGGCAGACCAMCAGATGAGGGTGGCAGTGTACCCGTCATTATCAGCACATCGCGTGTGAGTGCACCAGCTGGTGG 

GCCAGCTGGGGTGCAGACAGGTCCTACTGGAAGGAGTTCCTGGTGATGTGCACGCT^ 
20 TTATTCTTGCTCTACCGGCACCGGMCAGCATGAMGTCTTCCTGMGCA 

CCTGTGGTGCTGCCCCCTGAGACCCGCCCACTCAACGGCCTAGGGCCCCCTAGCACCCCGCTCGATCACCGAGGGTACCAGTCC 

CTGTCAGACAGCCCCCCGGGGTCCCGAGTCTTCACTGAGTCAGAGMGAGGCCACTCAG 

TCCCCAGTGTGCCCCCGGCCCCGGGTCCGCCTTGGCTCGGAGATCCGTGACTCTGTGG 

TGCCCTGGGTTCAGGGGCTGTGMTGOTCGGAGAGGGTCAACTGGACCTCCCCTCCGCTC 
25 GTGCCCGGCCCTTGGGAGCCTTGGGGCCAGCTGGCCTGCTGCTC 

ACAGCCGTGGCCCCAGAGGTCCTGGCCAMTATGGGGGCCTGCCTAGGTTGGTGGAACAGTGCTC 

TTGTTTAAAAMCMTTCCAAATGTGAAACTAGMTGAGAGGGAAGAGATAGC^ 

CATGGCCTCCCAGGGGTGCTGGGGATGCATCCAAAGTGGTTGTCTGAGACAGAGTTGG 

CTTCCACATTATCCCGCTGCCACCGGCTGCCCTGTCTCACTGCAGATO 
30 AGTCAGCCGAGGATGTAGTTGTTGCTGCCGTCGTCCCACCACCTCAGGGACCAGAGGGCTAGGTTG^ 

GGTCCTGGGCTCGGACCCMCTCCTGGACCTTTCCAGCCTGTATCAGG 

AGATTTCGTGACAATGTACGCCTTTCCCTCAGMTTCAGG 

GTGTGCCCCTTCCCACCATATCCACCCTCGCTCCATOT 

GTCCCCAGTTCACCCTCCATCCCTCACCTTCCTCCACTCTMGGGAT 
35 GGTTTTTATACATTTTTTMTAAGATGCACm 

92a 

GGCACGAGGCCGCTTGCGGGTGAGCTCTGCCCAAGCCGAGGCTGCGGGGCCC 

CGGCTCCGACGCCCCGCGGCGCCGGCGGGAGGACTGCGGTGCCCCGCGGAGGGGCTGAGTTTGCCAGCCGCGGCCGCCCTCCTC 
ACGCCACGGGGCGCCTCCCCGACTCAMCGGTGGGCCCACTTGACCCTGTTTCCCACCTCCCGCCCCCCAGGTCCGGAGGCGCC 

40 CGGGTGAACTGGGACAAAGGGTGGAGGGCGGGGGGTCCAGGCCTCCGCGGGGCCCCCGGGGCGACTCGGGGGCGGACCGCGGGG 
CGGAGCTGCCGCCCCCCCGGGGGCCCCGCTGAGCCCCCGCCTGGCGCCCCGCCTCGACGGCGG 
TGAGCCCGAGCCGCGGGACACCGTCGCTCCCACTCAGGCCGGCTCGGTGGACTCGGGCTCGGCGCCCTGTC 
CTCCGMTGCTGCGCACCGCGATGGGCCTGAGGAGCTGGCTCGCCGACGAGAGGCTT^ 
CGACCGAGCGGCCCCCATGGGGCGCGCTGCCGCCTCGGCCACCGCTGCTGCTGCTCCTGCTGG 

45 AGCCGGTGGCGACGACGACGAGGACGACCTGCTGCTCCTGCTGCAGCCGCCGCCTC^ 

GAGGACGACGTCGGCGGCGGAGGCTGGACCCGCGAGTCGGGGGCGATCAGGCTGCCT'CTGGGCTCTG 

AGATTCGMGCTAGTCGGACGGAGACCCGAGACTTCTCGCCGGTMGGAGTCTMGCTTCCTGMCACATC 

CCCTTCTGCTGAGCAGGGATGGCAGGGACTTGTGTAGAGGTTGATGTGTCGGGMGACGAC^ 

cgtgggtgctcgagaggccctctttgcactcagtagcmccttc 
50 tcgttggacagcttcctgccaggcggggagtaccaggagctgctttgggg ■ 
tcctcgacgamccccacgtctgccagagmgamcagcagtgcagcttcmgggcmgga^^^ 
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TGTCGTCACGTCGAAGTTCCCGTTCCTGGGTGTCGCGCTGTGTCAAMCTACATC 
CCTGTTACAGTTTTGATGTAGTTCTAGGAGGACGGCGAGTCGCCGTC 
ATGTGTACCTACATCMCATGGGTGGACACCGTGTCGTCGGAAGTC^ 
TGGCMGGGACGAGAAGGGGMTGTCCTCCTGGMGATTCTTGMGTGGGACCGTTCCCTC 
5 TCTAGGCMGGGCCGTTGTCCCTTCGACCCGMTTTCMGTCCACTGCCCTGGTCCGTTC^ 
MGTTCAGGTGACGGGACCAGGTOATGGCGAGCTCTACACTGGAACAGTCAGCAGC 
AGATGTGACCTTGTCAGTCGTCGMGGTTCCCTTACACCCGGCCATCTCGCGGAGCCAAAGCCT 
GCTGGGCCGGTAGAGCGCCTCGGTTTCGGMGCGGGGTGGTTCTGGCTOTC 

GGCCTCAGCCTACATTCCAGGGAGTTGACCGACGTTCTGGGTCGAAAACACCGGAGTCGGATGTAAGGTGAGAGCCTGGGCAGC 
10 TTGCMGGCGATGATGACMGATCTACTTTTTCTACTCTCG 

TCAGCGAGACTGGCCAGGMTTTGAGTTCTTTGAGMCACCATTGTGTCCAG 

CTTGTGGTMCACAGGCGCATTGCCCGCATCTGCMGGGCGATGAGGGTGGAGAGCGGGTGCTACAGCGTMC 
TTCCCGCTACTCCCACCTCTCGCCCACGATGTGCAGCGCTGGACCTCCTTCCTC 
TCGCGACCTGGAGGMGGAGTTCCGGGTCGACGACACGAGTGCCGGGCACGATGGCTTC^ 
15 TCACGCTGAGCCCCTGCTACCGAAGGGGAAGTTGCACGACGTCCTACAGAAGTGCGACTCGGGGAGCCCCCAGGACTGGCGTGA 
. CACCCTTTTCTATGGGGTCTTCACTTCCCATCGGGGGTCCTGACCGCACTGTGGGAA^ 
CACAGGGGAACTACAGMGGCTCTGCCGTCTGTGTCTTCACMTGACACCGTGTCCCCTTGATGTC 
AGMGTGTTACTAGGATGTGCAGAGAGTCTTCAGCGGCCTCTACMGGAGGTGMCCGTGAGTC 

GCCGGAGATGTTCCTCCACTTGGCACTCACACAGCAGTGGTACACCGTGACCCACCCGGTGCCCACACCCCGGCCTGGTGTGTC 
20 GTCACCATGTGGCACTGGGTGGGCCACGGGTGTGGGGCCGGACCAGCGTGCAra 

TCATCCCTGCTCGCACGTAGTGGTTGTCACGGGCCCTTTCCTTCTAGTTGAGTAGGGACGAGCTCCCAGACCGCGTC 
TCCTCMGGACCACTTCCTGATGGACTCGAGGGTCTGGCGCACGACTTGMGGAG 

CCGMGCCGCATGCTGCTGCTGCAGCCCCAGGCTCGCTACCACCCGTCCAGGCTTCGGCGTACGACGACGACGTCGGGGTCCGA 
GCGATGGTGCGCGTGGCTGTACACCGCGTCCCTGGCCTGCACCACACCTACGATGTCCCGCGCACCGACATGTGGCGCAGGGAC 

25 CGGACGTGGTGTGGATGCTACAGGTCTTCCTGGGCACTGGTGACGGOT 

GTGACCACTGCCGGCCGAGGTGTTCCGTCACTCGCACCCGCCCCGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATCGGG 
ACAGCCGGGGCCCACGTGTAGTAACTCCTCGACGTCTAGAAGAGTAGCCCTGTCGGCGTGCAGAATCTGCTCCTGGACACCCAC 
AGGGGGCTGCTGTATGCGGCCTGCACGTCTTAGACGAGGACCTGTGGGTGTCCGCCGACGACATACGCCGGACACACTCGGGCG 
TAGTCCAGGTGCCCATGGCCMCTGCAGCCTGTACCGGGTGTGAGCCCGCATCAGGTCCACGGGTACCGGTTGACGTCGGACAT 

30 GGCCAGCTGTGGGGACTGCCTCCTCGCCCGGGACCCCTACTGTGCTTGGAGCGGTCGACACCCCTGACGGAGGAGCGGGCCCTC 
GGGATGACACGAACCTCGCCCTCCAGCTGTAAGCACGTCAGCCTCTACCAGCCTCAGCTGGCCACCAGGCGAGGTCGACATTCG 
TGCAGTCGGAGATGGTCGGAGTCGACCGGTGGTCCGCGTGGATCCAGGACATCGAGGGAGCCAGCGCCAAGGACCTTTGCAGCG 
CGGCACCTAGGTCCTGTAGCTCCCTCGGTCGCGGTTCCTGGAAACG 

AGGGGAGMGCCATGTGMGMGCCMCACAGGGGCAGAAMCATGGmTCCCCTCTTCGGTACACTGCMGTCC 

35 CCCMCACAGTGMCACTTTGGCCTGCCCGCTCCCGTTCAGGTC 

TCTCCMCCTGGCGACCCGACTCTGGCTACGCMCGGGGCCCCCGTCMTAGAGGTTGGACCGCTGGGCTGAG 
GCCCCGGGGGCAGTTAGCCTCGGCCTCCTGCCACGTGCTACCCACTGGGGACCTGCTGCTGGTGGGCGGAGCCGGAGGACGGTG 
CACGATGGGTGACCCCTGGACGACGACCACCCCACCCMCAGCTGGGGGAGTTCCAGTGCTGGTCACTAGAGGAGGGCTTCCGT 
GGGTTGTCGACCCCCTCAAGGTCACGACCAGTGATCTCCTCCCGAAGGAGCAGCTGGTAGCCAGCTACTGCCCAGAGGTGGTGG 

40 AGGACGGGGTGGCATCGTCGACCATCGGTCGATGACGGGTCTCCACCACCTCCTGCCCCACCGTGACCAAAC^ 
CAGTGTACCCGTCATTATCAGCACATCGCGCTGGTTTGTCTACTCCCACCGTCACATGGGCAGTAATAGTCG 
AGTGCACCAGCTGGTGGCMGGCCAGCTGGGGTGCAGACAGGTCCTACACTCACGTGGTCGACCACCGTTCCGGra 
GTCTGTCCAGGMCTGGMGGAGTTCCTGGTGATGTGCACGCTCTTTGTC 
CACGTGCGAGAMCACGACCGGCACGACCTCCCAGTTTTATTCTTGCTCTACCGGCACCGGM 

45 CAAMTMGMCGAGATGGCCGTGGCCTTGTCGTACTTTCAGAACCTGMGCAGGGGGMTGTO 
TGCCCTGTGGGGACTTCGTCCCCCTTACACGGTCGCACGTGGGGTTCTGGACGGGACAC 
TCMCGGCCTAGGGCCCCCTAGCACCACGACGGGGGACTCTGGGCGGGTGAGTTGCCGGATC 

TCACCGAGGGTACCAGTCCCTGTCAGACAGCCCCCCGGGGGCGGTGAGCTAGTGGCTCCCATGGTCAGGGACAGTCTGTCGGGG 
GGCCCCCGCCGAGTCTTCACTGAGTCAGAGMGAGGCCACTCAGCATCCMGACAGCTGGCTCAGMGTGACTCAGTCTCOT 
50 . CCGGTGAGTCGTAGGTTCTGTCGATCGTGGAGGTATCCCCAGTGTC^ 

TAGGGGTCACACGGGGGCCGGGGCCCAGGCGGAACCGAGCGAGATCCGTGACTCTGTGGTGTGAGAG^ 
CTGCCCCTCTAGGCACTGAGACACCACACTCTCGACTGAAGGTCTCCTGCGACGGGTGGCTTCAGGGGCTGTC 
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agggtcmctggacctcccctcaccgaagtccccgacacttacgagcctctcccagttgacctgg 
cgtggmcacgaccgtggtgcccggcccttgggagcctgcgagacgagmgcacc^ 
cggatggagccagctggcctgctgctctccagtcmgtagcgmgctcctagcaacctcggtcgaccggacgacgag 
ttcatcgcttcgaggatggtcccagacacccamcagccgtggccccagaggtcctggccamtatgggggggtctc 
5 gtcggcaccggggtctccaggaccggtttataccccgcctgcctaggto 

tgcggacggatccmccaccttgtcacgaggmtacattoactcgggamctttag 
* tgagagggmgagatagcamtcttttgttmggtttacactttc 
acggctgctccagttcatggcctcccaggggtgctaccgtacgtcgtc 

TGGGGATGCATCCAAAGTGGTTGTCTGAGACAGAGTTGGAMCCCTCACCACCCCTACGTAGG 

10 cmcctttgggagtggmctggcctcttcaccttccacatta^ 

ggtgtmtagggcgacggtggccgacgggacactcactxxagattcaggaccagcttg 
gtgacgtctaagtcctggtcgaac^ 

accacctcagggaccagtcggctcctacatcaacmcgacggcagcagggtggtggagtccctgcagagggctaggttgg 
gcggccctcaccaggtcctgggctcggaccgtctcccgatccmccgtgacgccgggagtggtccaggacccgagcctggcmc 
15 tcctggacctttccagcctgtatcaggctgtggccacacgagaggagttgaggacctgg^ 
gtgtgctctcctcagcgcgagctcaggagagatttcgtgacmtgtacg 
agcactgttacatgcggaaagggagtctattcagggmgagactgtcg 

ccttctctgacagcggacggaaggaggcmcaacgcactcttggcgtgtgccccttcccaccatatccaccctcgctccatctt 
tgaactcaaagcaovcggggaagc&t^ 

20 TCCTCTCCCCAGTCCCCAGTTCACCCGTGCTCCTTGATTGACGTGGGACCAGGAGAGGGGTCAGGGGTCM 

TCACCTTCCTCCACTCTAAGGGATATCAACACTGCCCAGCACAGGTAGGGAGTGGMGGAGGTGAGATTCCCTATAGTTGTGAC 

GGGTCGTGAGGGGCCCTGMTTTATGTGGTTTTTATATATm 

TATATAAAAAATTATTCTACGTGATTATGTCATTT^ 

AAAATTATTTCAGACTTCTTMTGAGAMTTTTT^ 

25 93 

MVGYDPKPDGRMTKFQVAVAGSVSGLWRALISP 
AQILSIGYGAVQFLSFE1%TELVHRGSVYDAREFSWFVC^ 
EGPQVFYKGLAPTLIAIFPYAGLQFSCYSSLKHLYKWAIPAEGKKNEN^ 
RAAFGQVRRYKGLMDCMQVI^KEGALGFFKGLSPSLLKMLSTGFMFFSYEFFCNVF^ 

30 94 

ATGGTTGGCTATGACCCCAMCCAGATGGCAGGMTMCACCMGTTCC 
. ACTCGGGCGCTGATCAGTCCCTTCGACGTCATCMGATCCGTTTCCAGCTTCAGCATGAGCGCCTGTC 

GCAAAGTACCATGGCATCCTCCAGGCCTCTAGGCAGATO 

GCTCAGATTCTCTCCATAGGCTATGGAGCTGTCCM 
35 TACGACGCCCGGGMTTCTCAGTGCACTTTGTATGTGG 

GTTCTGCGCACCCGCTTTGCAGCTCAGGGTGAGCCCMGGTCTATMTACGCTGCGCCACGCCGTGG 

GMGGCCCCCAGGTTTTCTACAMGGCTTGGCTCCCACCTTGATCGCCATCTTCCCCTACGCC 

AGCTCCTTGMGCACCTGTACMGTGGGCCATACCAGCCGMGGAMGAAAMTGAGM 

GGAGCTGGTGTCATCAGCAAGACCCTGACATATCCGCTGGACCTCTTCMGMGC^ 
40 AGAGCTGCCTTTGGCCAGGTACGGAGATACMGGGCCTCATGGACTGTGCCAAGCAGGTGCTACAAMGGMGGCGCCCTGGG^ 

TTCTTCMGGGCCTGTCCCCCAGCTTGCTGMGGCTGCCCTCTCCACAGGCTTCATG 
. GTCTTCCACTGCATGAACAGGACAGCCAGCCAGCGCTGA 

95 

MELALRRSPVPRWLkLLPLLLGLNAGAVIDWPTEEGKEWro^ 
45 FGNFEEIGPLDSDLKPRKTTWLQMSLLFVDNPV^ 
GGKMAAGIGLELYKAIQRGTIKCNFAGVALGD^ 

WGKAEMIIEQOTI>3VNFYNILTKSTPTSTMESSLEFTQSHLVCLCQRHVRHI^RDALS 

VFVNMEEDFMKPVISIVDELLEAGINVTVYNGQLDLI^^ 

LAFYWILKAGHMVPSDQGDMALKMMRLVTQQE 
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96 

CCTGTTGCTGATGCTGCCGTGCGGTACT 
CGCTGCTGCTGGGCCTGMCGCAGGAGCTGTCATTGACTC^ 

gcaaggatgcctacatgttctggtggctctattatgccaccmctcctc 
5 ttcagggcggtccaggcggttctagcactggatttggamct™ 

AMCCACCTGGCTCCAGGCTGCCAGTCTCCTATTOTGGATMTO 
CCTATGCCAAGGACCTGGCTATGGTGGCTTCAGACATGATG^^ 
CAGTTCCATTCTACATTTTCTCAGAGTCCTATGGAGGAAAMTGG 
GAGGGACCATCMGTGCAACTTTGCGGGGGTTGCCTTGGGTGATTCCTGGATCTCCCCCGTG 
10 CmCCTGTACAGCATGTCTCTTCT 

ATMGGGGCTCTACAGAGAGGCCACAGAGCTGTGGGGGAMGCAGAMTGATCATTGMCAGMCACAGATC 

ATMCATCTTMCTAAMGCACTCCCACGTCTACMTGGAGTCGAGTCTAGMTTCACACAGAGCCACCTAGTTO 

AGCGCCACGTGAGACACCTACMCGAGATGCCTTMGCCAGCTCATGMm 

AGGATCMTCCTGGGGAGGCCAGGCTACCMCGTCTTTGTGMCATGGAGGAGGACTTCATGMGCCAGTC 
15 ACGAGTTGCTGGAGGCAGGGATCMCGTGACGGTGTATMTGGACA 
GGGTGCGGAAACTGAAGTGGCCAGMCTGCCTAMTTCAGTCAGC 
AMCATCTGCTTTTGTCMGTCCTACMGMCCTTGCTTTCTACTGGATTCTGA 

gggacatggctctgmgatgatgagactggtgactcagcmgmtaggatggatgggg 
ggcacagagctgagctgaggccgctgmgctgtaggmgcgccattcotccc 
20 ctgaccagcttctgcagaggataamtcattgtctctggaggcmtto 
ttttaaaamttgattottttgatcaamtamggatgatmtagata™ 

TTATAGAAAAACTGGGAAATACAGGTACCCAAAGAGTAAATCAACATCTGTATAC 
ATTTAGCATATGTCCTTGCAGMTTTTm 

TTACTATTTGTTTGACATATCAGTATATCTGAMCACCTTTTCATGTCMTAAATC 
25 98 

AATAGTTCATTMTTAGTCTTTATTATC 

TAAMTTTACAMAGMTAAMGCATTAAMTAMG 

CTCAAAAATCCGAAAGTGATTTTATCMTGTCOT^ 

TGCCGGGCTGGGGGCGGGAGCTACGGCTTCTCTGGGGACGCGGAAGCGAGAAGCAGGGACCTTGG^ 
30 GACGGTGCCCCGGCCCCCAGCCCACGCCTCCGGCCGCTGGCATGGTGCTGCTTGCTGGGACCCGGCCGCAGGG 

GCTGCATGACCCCGCCACCGCCGTCCCCACTCCTAGGCGCACAGGTCGAGGAGGACCGCGCTGACTACAAAGAGTTCCAGGGCC 
CCACACCTGCAGCATAGCCTGGGACGACTCTGTACCCTTTCCAGGACGAGGAGGAGCACGGCGTCGAGGGCTTGGAGAGCGTCC 
CGGAGGAGGGCGTCCAGGAGGCATGGGGCTGCTGCGGACGCTGGTGCGGCGAGGGGTGAGCGTTGAGAAGGCGCAGGAGACTGA 
CCACAATGGCTGGACCAGCCTTAATGTCGCCTGCTACCACGGCTTTGTGGATATACCGTGGTGGCCTTAGCTGAGTAACCCCAC 
35 ACTGACGTCAACTGGCAGGACAGGGAGGGGAACGCAGCCCTAAATACAGCTGCACAGGCAGGAGCTGCCTCTGGCCCCATGG^ 
ACCGTTGCCTTGGACACTGGCTCTCAGCCTTGTGCTTCCTGGTCAGCATC 
CTGGTGGAAACACAGGCTGCCAGGCCCTGCCCCTGGAGTCTTTCTTTGMTA 
MCAGGTTTCCTGGTGAGGCTGATGCCGCTGGCCCAGGGTTCTCACGTTGAGGA^ 

ATGTCCAGGMCCCTTCTCAGTAGGCACTGMGACCATCAGCAGMTCACAGACCCCAGGAGAGATGTCGTCAGACAGACACAG 
40 AGGCATCACCGMTTAMGTGAAAATGMGAMGGAGCTGAGCATCTGTTTCATGACTTO 

ATCCTGGCCCAGTCAGGGCACGCTATCATCACCMCTACTTGTTGMCTACGTTC 

TCGGGTTGAAAGCCGCCATGCAGGGTCGAACGGATCGCATCGGAGCCCTGAGCT^^ 

CGCCGTGGGATGTCGCCAGAGGAGTGGCCACTTATACGTC03CCTCATGCAGACX5CTCCTGGAGCGCCC 

GGGGAAAMGTACCAGCTTGAGCTGCCGCCGCTCCCCGAGAGGCCGCAAMGCCCG 
45 CAGGGACTGCGAGCTGTCCGCGCTGACGCCGCGCTCCATGCGGGGCCCAGAGGACCGGGGC^ 

GACCACGAGCCTCTMCAGCCCCGCCGGGGCCCTCGCGTGCCAGACTATGTGCCCCGAGAGCACCTTGTGCTGG 

99 

MAPMGIRLSPLGVAVFCLLGLGVLYHLYSGFLAGRFSLFGl£^ 

RESimiHEKSKGKTREGAEDKMTSGDVLSNRKMFYLLKTAFPSVQINTEEHVDAADQEVI^ 
50 VTVWIDPLDATQEYTEDLRKYVTTMVCVA 
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ALQTFGNQTTI I PAGGAGYKVLALLDVPDKSQEKADLYIHVTY I KKWDICAGNAILKALGGHMTTLSGEEI SYTGSDGIEGGLL 
ASIRMNHQALVRKLPDLEKTGHK 

100 

cacctcgcggccgtaggctmcgtggmgtcggaccagccggccggcggaagaacctagagcgcgctgcctggcgagtc 
5 cgcggggcggcgttggtggtcttcgcggcgtmctcggcctttcctgggagggagtgatggggcgc 
gcgccagtgtagcccgcgcggcgcctggcccggagcgcggcggctgcggcggcggcggcggcox^^ 
ccgcagcccggagcgcccggcttcccacgccatggcccccatgggcatccgcctttccccac^^ 
gctggggctcggcgtgctctaccacctctactcgggcttcttggccggccgcttcagcctcttc 
cggcggcgcgggggggcccgcggccgcggccgatgggggcaccgtggacttgcgcgagatgctggctgtgtcag 

10 agtccgcggcggcgacgaggtgaggcgcgtccgcgagagcaacgtcctccacgagaagtccaaggggaagacgcgcgagggagc 
cgaggacmgatgaccagcggcgacgtgctgtccmccgcmgatgttctacctgctcmgaccgc 
tmtactgaggmcacgtggatgcagctgatcaggaggttatcttgtgggatcatmgattcctgagga 
aactactcctamgaggtaccagcagamgtgttactgtctggattgacccacttc ■ 
tcgamgtacgtcactactatggtgtgtgtggctgtamtggtamcccatgctaggagttatacatmgccattttccgaata 

15 tacagcttgggcmtggtagatggtggttcamtgtgam^ 

tcgttcccattcagggatggtcamcaggtcgctcttcagacttttggam 
ttatamgttttagcacttttggatgtgcctgatmgagtc 
gtgggatatatgtgctggtmtgccatcttaamgccctaggggg 
tggttcagacggcattgmgggggactccttgctagcatcagmtgmccacca 

20 amgacaggacataaatgagcatmctgattacagggtacagttcto 
camggattggtc&agactatgcatggttmggccatccc^ 

tgtmtagmtgcaaaatcagggaamtgggttgctttgtgtctcmgtattgtcttta • 
gtcatacacmggcgcatatatatatttgtgmttaamtctgtagctgagtctaaaaaaaaaa 

102 

25 TTTTTTTTTATTTGGATTCTATTTAm 

TTTTMCTGTOGCAGGTAGTTTGTGGCAMGATGGCTAAATMTGAAGCAAATTAGM 

TTTTTTCTGTTGAGACTATCATTATTTGTCTTATTACCCAAGAGGCM 

AGGAATAGTTCTGTAMTACATTTAMTAMCTGTAMGATATTT^ 

GGTTTGMTAGTAACTAAATGAGACACCAGCCCTTGACATTGAGTTTG^ 
30 GTCATTCTAGCACTGATTATATGCTGCTACTTTAACTGGCTCCAGCTGCTTCACTACATC 

CAAMTGGACGGACMTTAMTGTAMTTATAGAACT^ 

TACAAAAA 

103 

MPIVDFHLQKLPAAAMPFLDKSHSSSPGSPKTAELFF^ 
35 YHSRGVLLKASERASGGSKAVQPGRKEKETPRHRKPRFSRRKNR 

104 

MTATTGCAGAGCMCTTGAMGGG05AGCTGGCCGCGTAGGAGGTGAGAGCCAGCCGCAAAM 

CTTCAMCCCGCCCGGTCGGATGTGGCATTTGMGAMGMGGCGTGGGTCAATMTCAACC 

TGACGCGACTCTCACCGCCTTCTCAGCTMGCCAGCTC 
40 TCTGGGCCAAGGGGACCGAGGGCTCCCGAGCAAGGGCTGGGGCGCCCTGAMGCATACCTTTCCACTGGCGCCMGA^ . 

GGGCAAMCGCGGGGACCAGACCTGAGGAGCCGTGGGAGCCCAGGGAGATCTGGGGCGGGAGCTTCGTTCC 

ATCCGCCCCGATACGTGGCTCCTGGGCCGCGGGCCTCAGCGGGCAGMGTGGGGC^^ 

TGGCATGCAGATGGTCGGTGCCCCAGCCTCMGCCGCTCTGCTTAAATCCAMGGAC^ 

TATTTTAAMTMTAATTTATGCTGGCGGMGGTTTTTAT 
45 GATTGTTGATTTCCACCTCCAGAMmCCAGCC^ 

CCCCAAMCCGCAGAGCTTTTCTTTACTGAGAGGACCCCTTGG 

ATCCAGGGGGACTAGAAGAAMGATTTTCTCAGCACTAGAAGMTACTTGG 

TTCMGGGGTGTCCTCCTGMGGCGTCTGAGAGGGCCTC 
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GCCMGACATAGAAMCCACGCTTTTCC^ 
CGAAGGCCTAMGGTTAGAMTACAGCMTTTTTCTCATGCCGM 
ACMGTTTTCCTCAACAGCGAGCTGGGACTGGCGACAATTTC^^ 
CAGCCCACGTAGCGGCCGCCGCGCCCGTGCCCGTGGCCTTC^ 
5 CTTGCAGGAAGAMCAAACACACTCCCTCTGACTC^ 

GTTTTGGGGTAGGCTCTTTGTCTGCCTGAGGCCCCGCACTCGCAGG^ 
MGTTAMTATTGACCAGMCCTCTTTCAACAGGCCGCGGTTGCCAGGMCCGTGGAGGG 
GCAMGTCCCACGCGGCGMGAGTTTTGGAGCAGCGCTTACCTAGAMGATGTTTAMTTCTG 
CCAGGCGCTCAGGGMTCGCCTTTTCCGGTGTCCAG^ 
• i 0 GMGGGAGTTGGAGGCCGGGAGTGGGMGCGAGGCTCAT^ 

CACCAGGGCACTCGTCCGTGTGGCGCACTCATTGCTCGAMTGTTCCAGTOTACAAATGGTCCG 
TAAAATAMTACATACTCCATCTCCACACCC 

106 

TTTTTTTTCCTAGTTTCATTAAAGATGGAG 
15 CACCCATTMGTTAAAAACTGGAATTTTACCTTTCCAAACTTATTTC 
TCAGTMTTCTCTTCMGCTCCATMTTTCCT^^ 
GTCTATTTCTATGTTCAMTTACACCTGTAAGATC^ 
ACAGGAGGTMTTTTATCTNCACCGGATACCATTTCCTCCTTGGACAOT 
GNG 

20 108 

GCTTACCTGGAAMCTGTMCMTCTCCAGGTTGACCTTTCTGTCTCCCTCTAMCACATCACATTG 
TTAATAAGGATTATATTACATATAAAA^ 

GCTTCATGATATATTMCCTTGTTGACCTCTTTGGTCTCTCTCAGTACTTCTCMTTCATAC 
TTAGTTTTAGTTCCCTGAGTATGTTATCAGCTCTCTGGATTCTGGGCT^ 

25 ttcatgtctttcttgccacaccccccaccccto 
. cagtttamtggcatttctttgggmgcatttcctgattctctacctcccaggctggg™ 
tgtagtacccttttcatggcatttacttaccatagcctatttc 

TGTTTTAAAGTCAGCACCTAAATCTCCCTGGCC 
110 

30 TCTCTACAMTMCTATMCAAMCTTTTTAGGAMGTACCACAMTATGTTAAACAG 
AMGCTGTGAGTTTCTTCTTACATAAAACTG&TACCT^ 
ATTATACAGTMCACCATACAGCCAAAACGGCC^^ 
TAGGGGAAMGAMCCTCTACGATAAACTCAMTGCCACCATCAGGGTT^ 
TAGCTGCTAATGAGTTTCTATGGTTCCAGNTGGGGTAGTTCAAAGTA 

35 112 

TAAAMGCATTAGGCATATAMTGTATAMTATATTTT^ 

TACAGGCTAGTATTTCAGCACAGACTTO 

GCCACCCMCCCCCTGCACACACCACCGGCCACCAGG 

GGTGGTGATCACAGCTC.CTAGCATMTGAGAGTTC^^ 
40 AGCGATGGCCCTGTTGATTTCACCCTGCCTTTTACTGMTCTGTAM 

TTGCCTTITCTAMCTGCTACATGTTTATAATCTTC 

GTCTCCTTGTCAGTGAGCCAGACTTGOT 

GCATGCMGMCTTTCCAGTAGCAGTGMGGAGGGCTGCCTCTC 

GCAGAGGCACTAGTAGCATGGGGGCTAGAGTGGGGAGCGAGATGTAAMGGGT^ 
45 GCATTAGGGTCCTGAGAACTTCTGAGTTCAGAGAMCATGC 

ACCCTGGGAGCTGCTTTGGGAGTGAGAAAGGCMCCCTCCMTGTGTTTCAA 

ATCTCATTTATTCTCCTTTTCTAGCGTTTGTTGM 

ATCTGAMCAGCTGAGGMGGGACAGAGMGGTACMGGGQAAGGCAGCACAAMCAG^ 
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TGGTTTTTTGTTTTGTTTTCT 

GGGTAMGTCMGTGCAGCATCAGTACTGCGAGACCCACCAGCCCCTGGAGAGGGTC 
CTAGGGTCCCGGCACCCTCACCCTCAGCCACCTGCAGAGAGGCCAGGGCCC^ 
TGCTGCCAGAGCCCTCTGCCCCTTATGTTGAGACCCTGCTTTCAGGACAGGCC^ 
5 AGTGTCACAGGTCCCTMCMTMTTTTCTGATCTGGAG^ 
GTTTGATTTATGATGAGMCTATCCGAGGCCACCCTTGGC^ 
MTTAGAAGAGTACCTMTGTTGAAAACATGACATGCGCTCTTGGGATCTGCTGT^ 
GTTACCAMGCTAGGAMGAGCTTTATCACMGCCTTCACTGTCCTGGCATGAGMCTC 
MCTGTGAATGAATCTGAGCTTGGTTTCCTTTATTGCTTCCTCTGCAATATGATTG 
1 0 TTGTC ACTCCTGTTMTGGGAGGATCAGTCACACATGTGTAGTACMGGCGGACTTOTC 
CATTGTGTGTGTTGCTTCCCCACCCTGAGGAG 

TTCAGGTGGCTTCCAMCTTGTACGCAGTTTAAAGATGGTGGGGACAGACTTTGCCTC 
MGGAGCATTTGMTCTCTTGGAAMGGCCATGMGMTAMGCAGTCAAAMGMGTCCTCCATGTTGGTG^ 

gaggggaaataaamtgttatccagcctgaccmcatggagamccccgtctccattaaamtacaamttagc 
15 ggcgcatgcctgtmtcccagctactctggaggctgaggcaggagaatcgcttgmcccaggaggcgga 
agatcatgccagtgcactccagcctgggtmcmgagtgamctccgtc^ 

CTGAMGCAAAMGGAMCCCTMCAGCTCTGMCTCTGGTTTTATT^ 
CATAATGTTAAAAAAAAAAAATTTTTTTTTGGTA^ 

ACCGGGGAGGCAGAGTACTACTAGAGGMGTGMGTTCTGATGGMTCATGCCTGTCAMTG 
20 ATAAMGAGTATATTTTATCTAAATCTTMGTC^ 

TGTATCTGTTTGTATTTTTCTTTGATGMTGATTGGTCATGAGGCCTCTTG 

AAGAACTATGTGTCTGGTCACTTATTTCTCTAAAATTATCTCATTGC 

CATTATGTACATGGGAMTGTAMCAMTGTGMGGAGGACCAGAA 

GGCTTCACATGTTTMCTTTTTTTMGAAAAMGTTGCATGMT^ 
25 TATCTGTTTCMTTCCTTGCTCATATCCCATATMTCTAGMCT^ 

GCAGTTGAGACTTTGATTTGMGCACCTCATCCTOT 

MCCATATGTTGCCATGMTTMCTCTGCCGCCTTTCTTMGGATCAAMCCAGTTTG 

GAMTAGAGATGCAGTACTTMCTTTCCTTGGTC 

AAAAGAGATGGTGACGCATGTAAATAAAGCATCAGTGACACTCT 

30 113 

MGTDSRMKALLARARTLHLQTGNLLW^ 
EREEMKVSAKLFIVESNSSSSTRSAVDMCSVLGVAQ^ 
KTQLEQLYQWAQVKPNSNQVNLASC^ 
I VKSRGI I KSKGYI LQAKRRGS 

35 114 

GGCACGAGGCTGCGGCCGCAGTAGCCGGAGCCGGAGCCGCAGCCACCGGTGCC 

GGCCTCCCTCGGGCCCGAGCGCAGACCAGGCTCCAGOT 

CTCGGGCACCGCGTCCTGTC&GCGGCCGCCTGCCTGC 

CATGGGCACCGACAGCCGCGCGGCCMGGCGCTCCTGGCGCGGGCCCGCACCCTGCAC^ 
40 GGGCCGCCTGCGGMGAAGTGCCCGTCCACGCACAGCGAGGAGCTTCATGATTGTATC 

CCAAATCAACCCAGATTTCK3TCAGGGAGTTTCCAGAT 

TGAMGAGMGAMTGAMGTTTCTGCAAMCTGTTCATTGTAGMTC 

GGCCTGTTCAGTCCTTGGAGTTGCACAGCTGGATTCTGTGATCATTGCTTCAC 

GGAGCATTTACAGCCTTACTGGGAGGMTTAGAAMCTTAGTTCAGAGCAAAAAGATTG 
45 CAAMCACAGTTGGAACAGCTGTATCAGTGGGCACAGGTAAMCCAAATAGTMCCMGTTMTCTTGCCTCC 

GCCACCAGATTTGACIWATTTGCTAMCMTTTGACATACAGC 

MGTTTCCAAGMGCTCTTCAGGAMGCATTCCTGACATTCAAGCGCACGAGTG 

CATTGTGAAMGTAGAGGMTOTCAMTCAAMGGCTACATTTTACMGCT 

TAACTTACCTGTMTTTCCTTCMTATGAGAGAAMTTGAGATGTGTAAM 
50 CAGATATTCTTTCGTCATATTTGACAGTATGTTC^ 
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gmcctctgtattgctttccttamctattc 

ATTTTTATGACTAATATAMTCMGAAMTTTC 
TTATTTCCAAAAAAAATTCATTMTTGCTTCTTAACTCTTATTA 
TGGMGMGATATTTTTCCTGTCATTGATGAGATATATCAG 
5 TCACTCTTCAATAC 

115 

MTAI IKEIVSRNKRRYQEDGFDLDLTYI YPNI IMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAERHYDTAKFNCR 
VAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAM 
PSQRRYVYYYSYLLKNHLDYRWALLFHKM^ 
10 IKVEFFHKQNKMLKKDKMFHFWW 

FSPNFKVKLYFTKTVEEPSNPEASSSTSVTPDVSDNEPDHYRYSDTTDSDPENEPFDEDQHTQITKV 

116 

CCTCCCCTCGCCCGGCGCGGTCCCGTCCGCCTCTCGCT^ 
GCGGCGGCGGAGGGGGCGGGCAGGCCGGCGGGCGGTGATGTGGCA^ 
" 1 5 TGGGACGCGACTGCGCTCAGTTCTCTCCTCTCG^ 

CCGGGCTCAGGCGAGGGAGATGAGAGACGGCGGCGGCCGCGGCCCGGAGCCCCTCTCAGCGCCTGTGAGCAGCCGCGGGGGCAG 

CGCCCTCGGGGAGCCGGCCGGCCTGCGGCGGCGGCAGCGGCGGCGTTTCTCGCCTCCTCTTCGTCTTT^ 

CTTCCTCGGCTTCTCCTGAMGGGAAGGTGGMGCCGTG&GCTC^ 

ACCTCCCGCTCCTGGAGCGGGGGGGAGAAGCGGCGGCGGCGGCGGCCGCGGCGGCTGCAGCTCCAGGGAGGGGGTCTGAGTCGC 
20 CTCTCACCATTTCCAGGGCTGGGMCGCCGGAGAGTTGGTCTCTCCCCTTCTACTGCCTCCAACACGG 

ACATCCAGGGACCCGGGCCGGTTTTAMCCTCCCGTCCGCCGCCGCCGCACCCCCCGTGGCCCGGGCTCCGGAGGCCGCCGGCG 
GAGGCAGCCGTTCGGAGGATTATTCGTCTTCTCCCCATTC 

CCCAGTCGCTGCAACCATCCAGCAGCCGCCGCAGCAGCCATTACCCGGCTGCGGTCCAGAGCCAAGCGGCGGCAGAGCGAGGGG 
CATCAGCTACCGCCMGTCCAGAGCCATTTCCATCCTGCAGAAGAAGCCCCGCCACCAGCAGCTTCTGCCATCTCTCTCCTCCT 
25 TTTTCTTCAGCCACAGGCTCCCAGACATGACAGCCATCATCAMGAGATCGTTAGCAGAMCAAAAGGAGATATCAAGAGGATG 
GATTCGACTTAGACTTGACCTATATTTAT^^ 

ACAATATTGATGATGTAGTAAGGTTTTTGGATTCAMGGATAAAMCCATTACMGATAT^ 

ATGACACCGCCAAATTTAATTGCAGAGTTGCACMTATCCTTTTGMGACCATMCCCACCACAGCT^ 

TTTGTGMGATCTTGACCAATGGCTMGTGMGATGACMTCATGTTGCAGCMTTCACTGT 
30 GTGTMTGATATGTGCATATWATTACATCGGGGCAMTTTTTAMGGCACM 

CCAGAGACAAAMGGGAGTMCTATTCCCAGTCAGAGGCGCTATGTGT^^ 

ATAGACCAGTGGCACTGTTGTTTCACMGATGATC^ 

TGGTCTGCCAGCTAMG&TGMGATATATTO 

CTCAGCCGTTACCTGTGTGTGGTGATATCAMGTAGAGTTCTTC 
35 ACTTTTGGGTAAATACATTCTTCATACCAGGACCAGAGGAMCCT^ 

TCGATAGCATTTGCAGTATAGAGCGTGCAGATMTGACMG^ 

CAMTAAAGACAAAGCCMCCGATACTTTTCTCCAMTTTTMGGTGMGCTGTACTTCACAAA 

ATCCAGAGGCTAGCAGTTCMCTTCTGTMCA^ 

ACTCTGATCCAGAGAATGMCCTTTTGATGMGATCAGCATACACA 
40 AAMCACCATGAAAATAAACTTGMTAMCTGAAAATGGACC^^ 

. GTTATAGGMCMTTCTCTTTTCCTGACCMTCTTGTTTTACCCTATACATC . 

AAAAGGTTGTGTAGCTGTGTCATGTATATACCTTTTTC 

CTTTCCCGTTTTATTCCAGTTTTATAAAMGTGGAGAC^ 

CCMCTGMGTGGCTAMGAGCTTTGTGATATACTGGTTC 
45 TGGCTMGAGAGGTTTCCGAMGGTTTTGCTA^ 

GGAMTAATTTTATGCTGGACTCTGGACCATATACCATCTCC^ 

ATCTGGACATTCGAGGMTTGGCCGCTGTCACTGCTTGTTGTTO 

GCTAAAGGMGTGMTCTGTATTGGGGTACAGGAATGMCCTTCTGCAAC 

MTGTCATAGGTMGAMCACAGCMCMTGACTTMCCATATAAATGTGGAGGCTATCM 
50 TAAAMlTCACAATGATTTATTAMTATGTTTTCTCAAmT 
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118 . . 

ttctatcagaaattgtat™ 
mttmggcattgttctttatttcaaanttot 

mtactcaccatcaggatatmtamcggaggtttctttotccgaaatccataaantgtag 
5 attccnatttttgcagtnggcttcctc^ 
aggataggagtacngggcatttntggaaaaacccg 

119 

MERVKMIWQRLLEMEFLERRERECEHGYASSFPSMPSPRLQHSKPPRRLSRAQ 
LCLERLKVLIPLGPDCTRHTT^ 
10 RSDSEREEIEVDVESTEFSHGEVDNISTTSISDIDDHSSLPSIGSDEGYSSASVKLSFTS 

120 

agattatgatcgcctgaggcccctctcctacccagataccgatgttatactgatc 
agtaattaagggtagtoamttatotamgtatacam 

agggagtc£ggagaggccccggtcgccacccgcggtgcccatggagcgggtgmgatgatcmcgtc 
15 tgccgagtttttggagcgccgggagcgagagtgtgmcatggctacgcctcttcattcccgtccatc 

gcattcaaagcccccacggaggttgagccgggcacagamcacagcagcgggacgagcmcaccagcactgccmcagatctac 

acacaatgagctggaaaagaatcgacgagctcatctgcgcctttgtttagaacgct^ 

■ ctgcacgcggcacacmcacttggtttgctcmcamgccamggacacatcmgamcttgmgmgctg 

gcaccacktcgagmtttggmcgagaacagagattttt^ 
20 aatacgaatggacagcattggatcaactatttctt^ 

cacagagttctcccatggagmgtggacaatatmgtaccaccagcatcagtgacattgatgaccaca^ 

tgggagtgacgagggttactccagtgccagtgtcamctttcattcacttcatagmcccagcatc^ 

amtattcactgggccaattcmtacamcaatctcttamttgggttcatgat 

aaacaamctatacttgaacaamgggtcagaggacctgtatttaagcamtactt^^ 
25 agmcamtattcagmtattcatattggaaamtcacmtttto 

agttgattgagaagaggacattggagatgccatcctctttctcttttctcgtttgctcatac 

ggatggggttatgmcccttcctgagctttatc^ 

tattmtcttggatagctaatmtgagctattaamctcagcctgggacagtttatcatgmgcctg 

attattattttttttttttgaaaaaagctcatttcatgct 
30 catgcagctcaactttctgttggattccatgctmgcaagctm 

ctctccatcctgctgcccttaggccacatgggagcagtccatgcatgacagccot 

ggggggccaamggaaamgctccatgtgcctctttgtctgcgtgggtcagmgagttc 

ctgagccacagcagcatttttatttcagattttgatmctgtttatatgtg 

tccataaaaaaamtagatgtcttttatagtggaaamcacatggggaaaaamtcatctattttga 
35 ataamcacctcacacctcactctttatagtgcacaaaatgaatgaggtctgggctaggtagaaam 

gtttttagaatcattaccttttaccagcttttaaccatctc 

cagcacttgtctcattttmtgtaaagatttgcttccattttc 

gtgctattgttactcttacgaatattttca 

ttccctttgtctttcaaactccmggttcccctgtggccctctcc 
40 tgtttggactttgtatactttamtmtttm 

121 

. MNSNFITFDIiKMSLLESNLFSATC 

PEAALEDKLARIRENHERALREAKETLQKLPEEIQRDILLEKKKVAQDQLRDKAPFRGLPPVDFVPPIGV^ 
AKIKEMMKHAWNNYKGYAWGliNELKPI SKGGHSSSLFGNIKGATIVDALDTLFIMEMKHEFEEAKSWVEENLDFNVNAEI SVFE 
45 VNIRFVGGLLSAYYLSGEEIFRKKAVELGVKLLPAFHTPSGIPWALLNMSGIGRM 
GNPIFAEKVMNIRTVLNKLEKPQGLYPNYLNPSSGQWGQHHVSVGGLGDSFYEY 
IRKSSSGLTYIAEt^/KRGLLEHKKGHLTCFAGGMFALGADMPEGHAQHYLELGA 

I ATRQNEKYYI LRPEVMETYMYMWRLTHDPKYRKWAWEAVEALENHCRVNGGY SGLRDVYLLHESYDDVQQSFFL AETLKYLYL 
IFSDDDLLPLEHWIFNSEAHLLPILPKDKKEVEIREE 
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122 

CCAACTTATTOAAMCAAAACAATT^ 

ATMCTTGCCAAAGGCCACACAGCTGGTGAGTGATGGAGMCGMTT/^ 

CGTTAGTGTTTCTCACTATCCAGGTCTGCCTTTGCCTTATTO 
5 CAGTTTGGGGGGATCATTACAGCGGGTTTTTM 

CMGGATGACCTCAGGACAGTACAGCAAGGACACAGTGGCACTTCACATTTTGTTCCCACGAAATGACTGG^ 

ATCATCTTCCTTTAGMTGTGGAMCATCAGCAGMGMTATTAGTCTTTATAC 

CTMTAGAGCTGACTTTCAGCCATGATAGCTTTGGCACACCTCACATCCCTTTGTTCAACCTCTCTTC 

CATTCCTGGGAATTTCTGTTGTGCACTTTTCCCACTTGCCC^ 
1 0 CCTTTCCTTCATTTGGMTGMCAGCAATTTTATTACTTTO 

TCATCACGCTCTGCTTCGGGGCGATCTTCTTCCTGCCAGACTCCTCCMGCTGCTCAGCGGGGTCCTGTTC 

CCTTGCAGCCGGCCGCCGACCACAAGCCCGGGCCCGGGGCGCGCGCCGAGGACGCGGCCGAGGGGCGAGCCCGGCGCCGCGAGG 

AGGGGGCACCCGGGGACCCGGAGGCCGCCCTGGAGGACMCTTGGCCAGGATCCGCGAAMCCACGAGCX3GGCTC 

CCMGGAGACCCTGCAGMGCTGCCCGAGGAGATCCAMGAGACATCCTACTGGAGMGMGMGGTGGCCCAGGACCAGC^ 
15 GTGACMGGCGCCGTTCAGAGGCCTGCCCCCGGTGGACTTCGTGCCCC^ 

CCATCCGCGAGAAAAGGGCAMGATCAMGAGATGATGAMCATGCTTGGAATMTO 

MGTCAAACCTATATCAAAAGGAGGCCATTCMGCAGTTTGTTTGGTMCAT^ 

CACTTTTTATTATGGAMTGAAACATGMTTTGM 

AMTTTCTGTCTTTGMGTAMTATACGCTTTC 
20 AGAAAGCAGTGGMCTTGGGGTAAMmCTACCTGCATTTCATACTCCCTCTGGM^ 

GTGGTATTGGMGGMCTGGCCCTGGGCCTCTGGAGGCAGCAGT^ 

acttgagccacttatcaggamccccatctttgctgaamggtaatgaatattcgmcagtactc 

mggcctttatcctmctatctgmtcccagtagtggacagtggg^ 

tctatgagtatttgctgmggcctggttmtgtctgacmgacagatctggmgctmgmgatgta 

25 CTATCGAGACTCATTTGATCCGCMGTCTAGCAGCGGACTAACTTATATCGCAGAGTGGAAAAGGGGCCTCCTGGAGCACAAGA 

TGGGCCACCTGACCTGCTTCGCGGGGGGCATGTTCGCACTCGGGGC 

MCTCGGGGCTGAMTTGCCCGTACTTGTCATGMTCATATMTCGAACATTTATGAAAC^ 

ATGGTGGTGTTGMGCCATCGCTACMGACAAMTGAAAMTACTACATCT^^ 

TGTGGAGACTGACTCATGATCCAMGTACAGGAMTGGGCCTGGGM 
30 GAGGCTATTCAGGCCTMGGGATGTrTACCTTCTTCATGAGAGTTATGATGATC 

TGAMTATTTGTACCTMTATTTTCTGACGACGATCTTCTTCCACTGGAGC 

CTATCCTCCCTAMGATAAAMGGMGTTGAMTCAGAGAGGAATAAAAMGACATTTATATTTTATTCTC 

ACTGTATACCTTAATMTTCCTTTTCTGGTAATCAGGCACATGATGMCTTTGATTAGTAGGTCTG 

GTTTTGCAGTCTTTTATGTTTATTATCAT^ 

AGMTTGTATAGCGATGACCTCTTMTTATAATTTGA 
TTMGCTGTMTGTTTTGCCATGTTGCAAAAA^ 

TGGTTTGTCTGTCCACCAGAGACAGATCTTCTGTGACAGCCTCCTTATGCAGGTCTATCATTAT^ 
ATACTTCACTCACATTGTMTTCAMTTAGAMGTCATTCC 
40 ATATTTTTGTTGGTTMTTATATTAGTGTTTTCTATTTO 

124 

TTTTGGTCTGAGMTGATGGACATTTAGACACTGGCGCCAGGTTTGCGCTGGACCGGCGCACGCAGG^ 
ACACACAGGTGGGCTACAGGTGTCACACGGCACCAGCCAGGGCCCGGGG 

AGGAGTCAGGTCMGTCCAGGTGGTCAGTGCCAGGGGTCCCAGGAGGGGAGGGCAGTGCATTAACCCTCCTGGTC^ 
45 ACCAGGCGGTCGTCACAGAAAGCAACCTCGGCCCGGGGCCG 

126 

MTTATATATTACATGTTTAT'TMGAGCACMCTTTTATGTAAMTTTACATTTM 

CTJX5GAAGACAGATGTGCATTGTTCTMTO 

GTOTACAMTTCTTGGCTAMGAGGCAGCTACAMGTTTACCACTAT 
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TTGAAMTCTACTGGTCTCTTTTACACCATTA 
ATTGTA 

128 

AATATAGTTATCTTCTTAAAMCCATTATAACMTTO 
5 GGTTCTTTTAMGCMTTTTCAAAATMGTACCMTTAAAG 

CCATCAGGTGACCAGCTGTTGCATTTCTTGCmTTCTC 

GTGMGGGTTTCTCTAGCTTTGGTTATGTGTMTGTTCACGTGACCTTTTTT^ 

TGTGCTTTATTACTMTMGTCCMTGAGTGAGTAGTAGCTAG 

CCTTTCAGAMTGCCTCATCTAMGATACATGATMTTTTG^ 
10 CCATTGCCAMTACTGAMTGGMGCCCGTCTTACCT^ 

CTCCTCACTTGTGGCCCAGTGCTCTTTCTGCTATACAAAATGTCTMTCTCAGAT^ 

GGATGMCTACAAAAMCCCATGATTM05TTTATGMTTCMGTMTMTTAGA 

TTATTGGATATGTTTGTMCACATAAACACAMGCACTTTTCAMCATGATGCACTTTTATC 

TTCCTCCTGGGATATGAGAMCATTTTAAAAMCGTATTTM(^ 
1 5 . TTATTTACTTAMTGTTTATMTATC 

TMTTGTTGCAGAGTTCTGCCTGTTACAMGC^ 

AGCTTCACATTAGAGTGATGTCATMTGTAAMTGTTTGC^ 

TCATCTTTTATTGTAMTATTTAGACTTTACCACAGAMTATTGGAACAGTTTC 
• TGGAGCTTGCCTTGTGCTTAGAMTAATATGTTGMCTATTT^ 
20 CTTCTTAAMTAGTGTGGTATTTCAMTATTGCTAGAGCTATTTTCCTGAMTACAT^ 

AGGMTATTTTTATTGATTGMGGAMTGACTGTACTGCGATTCAAMGTAMCTTATTTTATTA 

ACTCTATTTATACCTTMCATGAMTCCATGAC^^ 

MGTTAAAAACAGTAATGCCMCATTGMTTTATTTTTGAGGTC 

TGAGTCCATATACTATGTATGTTFACATATACTATACATGCA 
25 TAAAGTTCACTTCTTGAGCATCAATAAAMGGGAAGCTGTGTGGTTTTGG 

129 

MSLMVS AGRGLG AVWS PTHVQVTVLQ ARGLRAKGPGGTSDAYAVIQVGKEKYAT SVSERSLGAPVWREEATFELP SLLS SGPAA 
MTLQLTVLHRALIX^LDKFLGRAEVDLRDLHSSLGKSFFKTLKKRAWAIFLRLCLKKN 

130 

30 GTGGAGGCCGCCAGTCGCGGCGATCTTCTCCTCGCTTCTC 

GCCTGGGGGCCGTGTGGTCCCCAACCCACGTGCAGGTGACC&TG^ 

CGAGCGACGCGTACGCGGTGATCCAGGTGGGCMGK5AGMGTACGCCACCTCCGTGTCGGAGCGCAGCCTGGGCGCGCCCGTG 

GGCGCGAGGAGGCCACCTTCGAGCTGCCATCGCTGCTC 

ACCGCGCGCTGCTCGGCCTCGACMGTTCCTGGGCCGCGCC^^ 
3 5 TTTTTAAGACTCTTAAAAAMGAGCCTGGGCAATTTTTTTMGACTC 

TTCACTCTACAGGGGACAGGAGACCATGGATTGGACCCCAAAGGGATTGAACTGC 

TCTCCCTGCCCAAMGGAMCCCAMTTATTTGTGGGATACTGGGGAMTTGTAGTC 

AAMGTCAGTAGAAAACAGGTGCCTCAGCCTTCAAATGGTTGCTTCT 

TACCCCCTTGTTATAAMCTGTGCAGAGCAAGMGATGAT^ 
40 TTTTTTTTTTTAACTAGGG 

TTMGACTGGMTTAGAATGTTTCTGGTATTTCTC^ 

CTAGGGGMGMTATCACAGGCTMTAGCGTGGTTGGGGGTGMG 

GTCCTTGTTACATTGAGGTTMGAGACAAMTCATTGGCAGTGC 

ATCAGAGCACTTTMTCTGMGGATGATACTGTMCTTGATTT 
45 TTMTTCTCTCCTACAGTACTGGGACCACTGTAAACTTCTCAGATGACm 

CTTATAAAG AAAAC TGTATGTTC ACTTGAAAAAAAAAAAAAAAAAA 



131 
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MCAFPWLLLLLLLQEGSQRRLWRWCGSEEW^ 
QVSFLDPSYSLHISNLSWEDSGLYQAQWLRTSQISTMQQYNL^ 
WLSRGDSTYTFHEGPVLSTSWRPGDSALSYTCRANNPISWSSC 
GLVJVIRVQKRHKMPRMKKLMRNRMKLRKEAKPGSSPA 

5 132 

ACTAGGAGCCTCCTMTGCAGTCTTCTGCACAG 

ATGTGTGCCTTTCCTTGGCTGCTTCTTCTCCTGCTGCTCCAGGAGG 

GMGTGGTTGCGGTCCTTCAGGAGTCCATCAGCCTCCCCCTGGAMTACCACCAGATGM 

TCTCACAAMGTOTGCCACTGTGGTGCCAGGGAMGAGGGACATCCAGCTACCATCAT^ 
10 CMGTGAGCTTCCTGGACCCCAGCTATTCCCTGCATATCAGCMTCTGA^^ 

MCCTGAGAACATCCCAGATCTCTACCATGCAGCAGTACMTCTATGTGTCTACCGATGGCTGTCAGAGCCCCAGATCACTC 

MCTTTGAGAGTTCTGGGGMGGTGCCTGCAGTATGTCCCTGGTGTGCTCTGTGGAG 

TGGCTCTCCCGGGGGGATAGCACTOTACATTCCATGMGGCCCTGTCCTCAGCACATCCT^ 

TCCTACACCTGCAGAGCCMCMCCCCATCAGCMCGTCAGTTCTTGCCCCATCCCTGATO 
15 TATGCTTCTGAGMGCCTTCMCAGCCTTCTGCCTCCTGGCC 

GGACTCTGGGTCATCCGAGTCCAGAAAAGACACAAMTGCCMGGATGAAGAMCTCATGAGAMCAG 

GAGGCAMGCCTGGCTCCAGCCCTGCCTGACTGCTCCTTGGGAACCCCAGTCCTG^ 

TCCTTCCTCAGCTCTCTTCTTTCCAGGGGMGGAGGTGCTCAGGGGTGGGTATCCAGAGAGCCATACTTC 

CTGGCAATAAAGTCAAATTAAGTGACCACCAAAAAAAAAAAAAAAAA 

20 133 

MIEEKSDIETLDIPEPPPNSGYECQLRLRLSTGKDLKLVVRST^ 
PKDYWQVIVSQPVQNPTPVEN 

134 

ATATCAGCTTCCAGTGTATTGCTTGGCACCGCCMTCMCATGATAGAGGAAAAGAGCGACATAGAGACTCTGGATATTCCTGA 
25 GCCACCACCCAATTCTGGATATGAATGTCAGCTTCGTTTGCGCCTTTC 

AGACACAGTATTCCACATGMGAGACGGTTGCATGCAGCAGAGGGAGTGGMCCAGGTAGTCAGCGGTC 

ACCTCTCACTGACAAMTGMGTTCGMGAGCTGMGATCCCAMGGACTATGTTC 

GMCCCMCACCAGTGGAGMCTGMCTGAGCCCTGTTGGCCAGCTCCCACATCCCTCTGCTCCTTTO 

ATTTCCTACTCTGCGGCGTGAMTCTATTTGACTGCTCTAAATTCCOT 
30 MTTCCATCTGTGATGGAGACCMCAAAMTMTAAMCACAAAGAGCCAGGCTTTGAGACTCATGTMTO 

TCAMGGCAGOTMGAMTAGATAACCATTTATTTTAGMCACTC 

TAMTGAMCATTGCATCCATGAAGGACAGCACCAAGCACCTTTTTC^ 

TMTTTCCAGAMCCATAMTMTGGATATAAMCTTMCCTTTTTGTTTTGT 

TMTGGAMCTGAAMGAGCAGTATTTGAGGTTGCTTCTATO 
35 GAGTAGAGACTGGCAMTlGTGCTTTAGTGTTGCm 

TGGTGCCTTCCAGCACATTAGTGGCAGGCACCCTTCTGGMCAOTAGGCMTMTTO 

TCMCAGATACTCAGTTGAAMGTCGCTGTCATCTTGCTGCATMGTATTTTGAMGGTCTG 

CCAGGCTTAGMGGTCACTACTATATAGTACCTTCATTGATCTATCTATTCTGCTTC^ 

CAMTGCATGGTTCCTGGGTCAGGAGACACCAAMTCMTTATAGCTGTTCCCACTC 
40 ACTTTGCATGGAGGGGTGTAAGGAMGCCTATTTTATCTC 

TCMCTCAAAATTTTTMGTGTGAMGGmACTGGTGTCACATAAACATTAGCA 

TACCmGTGCTTTCCAGTTCACTGTTACCTCTAGTCATGGTAGATGA 

AATTAAGAGCCTTTTGCAAGTTAGATTAGGGGACTGAAAMTCCAGGGAG 

AGCTGCCMGAGTTAGGTTGMTTTGGAGCCTTTTAAM^ 
45 CCCCTGAGAATCCATGAGATGGAAMGTACAGACMGMGGCACTTAACGCTGTTCTCAGTTC 

CTGTTTGGATGTGMGTGCATTATTMTT™ 

MTTTCCTATTGCCTGATTAMTATATCTGTMTTGTMCATCATG 

TCATTTTAMGATTGTGGCTATATCTMTTTAAATTTC 
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AMGGAGATTTTTTTTATCTTTAGAG 
GTGTGCCACATCGMTAGCAGCACTTCTCAAAGTTTC^ 
GGTGATGTCATCMTATCTATGATATCGATTGGTAAAAGGTCTTTC 
TTATTTTAMGGMCATATGCTTTGMTMCMTTCTCTGGGCATGTTTM 
5 ATTGTAMTAGATTGATATMGATTGGACCAGTACTTG 
AAAAAAAAAGGTTCAATTTT 

136 9 

GGMGGTATTAGACAMCATGCCCATGAMCCCCAGCTTCCCTTTO 
TCCAMGTCATTTGGACTTCMGGCATTTAAMTCATTTTTCT^ 
10 MTTGGTGAGAGACCAGGCTGATCCTGGCAMGGTCTTGTGTCTTTCTGC^^ 
AGCCAGTCTGGGGACGCTGATATTGAGGATGAGCTGGGGGACTCTGCTCTC 
TGAGTGAGGTTGATTCTCTCTCCCTCTTTGC . . 

137 ' 

MCTSLSPffl)PWPLNPLSIQQTTLLL^^ : 
15 FQIRLGSCPIWLNGERAIHQALVQQGSAFADRPAFASFRWSGGRSMFGHYSEHW^ 

HVLSEMELVALLWGSADGAFLDPRPLTWAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSLVDV^ 
PVRWFREFEQLNRNFSNFILDKFLRHCESLRPGMPRDMMDAFILSAEKKMGDSHGGGARLDLE 

TALQWLLLLFTRYPDVQTRVQAELDQWGRDRLPCMGDQPNL PYVLAFLYEAMRFS SFVPVTI PHATTANTSVLGYHI PKDTW 
FVNQWSWHDPVKWPNPE^DPARFLDKIXSLINKDLTSRVMIFSVGKRRCIGEELSKMQLFLF 
20 FSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ 

138 

actctggagtgggagtgggagcgagcgcttctgcgactccagttgtgagagccgcmgggcatgggm™ 
acccccagtctcmtctcaacgctgtgaggamcctcgactttgccaggtccccaagggcagcggggctcggcgagcgaggcac 
ccttctccgtccccatcccaatccmgcgctcctggcactgacgacgccmga 
25 ggcagcaggtgtccaggccgggcctgcgggttcctgttgacgtcttg 

tgtcccgcgccactggaaaccgcacctccccgcagcatgggcaccagcctcagcccgaacgacccttggccgctaaacccgctg 
tccatccagcagaccacgctcctgctactcctgtcggtgctggccactg 

cgg^gctccggtccgcgcccccgggcccgtttgcgtggccactgatcggamcgcggcggcggtgggccaggcggctcacctc 
tcgttcgctcgcctggcgcggcgctacggcgacgttttccagatccgcctgggcagctgccccatagtggt^ctg • 
30 cgcgccatccaccaggccctggtgcagcagggctcggccttcgccgaccggccggccttcgcctcc 
ggccgcagcatggcottcggccactactcggagcactgc^^ 

acgcgocagccgcgcagccgccmgtcctcgagggccacgtgctgagcgaggcgcgcgagctggtggcgctg^ 
agcgcggacggcgccttcctcgacccgaggccgctgaccgtcgtggccgtggccmcgtcatgagtgccgtgtgtttc 
cgctacacxxacgacgaccccgagttccgtgagctgctcagccacmcgm 
35 gtggacgtgatgccctggctc^agtacttccccmc 

agcmcttcatcctggacmgttcttgaggcactgcgamgccttcggcccggggccgccccccgcgacatgatggacgccttt 
atcctctctgcggaamgmggcggccggggactcgcacggtggt^ 

actgacatcttcggcgccagccaggacaccctgtccaccgcgctgcagtggctgctcctcctcttcaccagg 

cagactcgagtgcaggcagmttcgatcaggtcgtggggagggaccgtctgccttgtatgggtc^ 
40 gtcctggccttcctttatgmgccatgcgcttctccagctttgtgcctgtc 

gtcttgggctaccacattcccmggacactgtggtttttgtcmccagtggt^ 

ccggagmctttcatccagctcgattcttggacaaggatggcctcatcmcaaggac 

gtgggcaamggcggtgcattggcgmgmctttctmgatgcagctttttctcttc 

ttcagggccaacccaaatgagcctgcgaaaatgmtttcagttatggtc^ 
45 actctcagagagtccatggagctccttgatagtgctgtccaamtto •■ • 

mgctgamttttagamtattcacatcttcggagatg 

tcaattagcgtttmggtgagcatamtcaactgtccatcaggtgaggtgtgctccataccca 

GCTATGCAGGAGCTTCTGGGAGATTTTTTTGAGTCAMGACTTAMGGGCCCM 
TATTTCTGMGGTAGCATTCTTTGGAGTTAAMTGCACATATAGACACATACACCCAM 
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GMGTATTTTGGTAACCAGGCCATTTTTGGTC 
CAAAGTTTCAGAGTATATTGTTGMGAGACAGAGACAAGTMTTT^ 
TAMGACCAGAMTTCCCTTTTCACCTmCAGGAA^ 
TGGTATACTTCCTTACTTTTAAGGATAMTCATAAAGT^ 
5 GGTTCCATCAGTTATCATGMTTTTAAAGTATGC^ 
AMGTACAGMTWMTTATCAGCAMGAAAAAAAA 

AGTMGTCTCATAGGTTAAAAAAAAMGTCACCAAATAGTGTGAMTATATTACTTMCTC 

ATCTTGTTCAGGAAMGGTTGMTMTATATGCCTTGTGTMTATTGAAMTTGAA 

AAAMTGAGCTTGATTAMTCMCCACCTATTTTTGACATC 
10 TCTCAAAATTAGATCCTMGATGTGTTCTTATT 

AMTMCCTMTTAATATATTAAAATTCCAMTTCATGGCATGCTTAMTTTTM^ 

GTTCCAGTTGMGTTAGTGGAAATCTGMCATTCTCCTGTGGMGGCAGAGAAAT^ 

AAMTGCCATGMTTACCTGGATGTTCTTTTTA 

TCTTCTAGATGATTTTTTGAMGTTMCATTM^ 
1 5 GAAMGACAGTGGAGATGAGGTCAGTTGTGTTTTTTMGATGGC 

ATGTATGCATTCACATTTAGAAMGTGAATTGMGTTTCMGTTTTAMGTTC 

AGTGTCCTMGTGCTMGTGCTTATTACATTTTATTMGCTTT^ 

ATAGTTGTGTGTATGTGTGTGTGGGGTGGGGGGATGGT 

TTTTCCCACTCATTCTGMTTMTTMTTTC^AGCACAAMTTCAMGCA 
20 AGTTAMTCTCAMTAGGCTATTAAAAMGTCTACMCATAGCAGATCTGTTTTGTGGTTTGGAA 
• ATTTTATTTTAAMTTTCATAGCTGTACTTCTTGM 

AMGCAGGCTTGCCCAGTACATTTAMTTT^^^ 

ttgggctgctgtagcctatttttttaga™ 
tcaaccaggtccagatgtgctatmtctc^ 
25 gtttatatggmgaaagtaaggtgcttggagtttacctggctt^ 
acaaaaaacgmtgaamtmctgmtttggaggctggagtm^ 
ccggagagagmtgtatttgctgacmccattamgtcagaagttttactccagg™ 
atgcttcatttgtatgtcamgctttgac^ 

gaattttctaamgctttcatgtcccagmcttagcctttacctgtgmgtgttactaca 
30 tatattagatcamtagttgcatagcagtatatgttm 

CTTTAGTAGACATTTATMCTCMGGATACCTTCTTATTTAATCTTTTCTTAT 

GTGCATMTAGCTACAGTGCATAGTTGTAGACAMGTACATTCT^^ 

ACCAMTTAAAAAAAMTTGTATCTCATTACTTATACTGGGA 

AAAA 

35 139 ' 

mdistrskdpgsaertaqkrkfpspphssnghspqot^ 
gritofycwvchregqvlccelcprwhakclrltsepegd^^ 
gtdafqkpvpleqhpdyaeyifhpmdlctleknakk^ 
evcpecylmcqkrdnwfcepcsnphplwaklkgfpfwpakalrdkdgqto 
40 sifnsmqmevyvenirrkfgvfnyspfrtpytpnsqyqmlldptnpsagtakidkqekvklnfdm 

grrislsdmprspmstnssvhtgsdveqdaekkatsshfsaseesmdfldkstaspastktgqagslsgspkpfspqlsapitt 
ktdktsttgsilnlnldrskaemdlke^ 

sedseksdssdseyisddeqksknepedtedkegcqmdrepsavkkkpkptnpveikeelkstspasekadpgavkd 
kdfsekmpsphpikdklkgkdetdsptvhlgldsdseselv^ 

45 PPETPVLTRS SAQTSAAGATATTSTS STVTWAPAPAATGSPVKKQRPLLPKETAPAVQRVVWNS S SKFQTSSQKWHMQKMQRQ 
QQQQQQQWQQQQPQSSQGTRYQTRQAVKAVQQKEITQSPSTSTITLVTSTQSSPLVTSSGSMSTLVSSVNADL 
DIAKYTSKVNGCNKRNNDRNIQRSF 

140 

GGAGTGTAGAGGAMTAAMTCATGTCTGTGGCTCATGACAGCATTCCATATTC 
50 GTGTTTTTMTAATACCATGTMCTG 



WO 02/46465 



PCT/G BO 1/05458 



317 



AACTGTGGAAAMTGAGACGGTTGCTTGAACATTTC 

GCATGGATATCTCTACTCGCTCCAMGATCCTGGCTCTGCAGAGAGMCAGCCCAGAAMGAMGT 

CTTCCMTGGCCACTCGCCGCAGGACACATCAACMGCCCCATTAAAMGAAAMGAMCCTGGCTTACTGAACA 

AGGAGCAGTCAGMCTMGACATGGTCCGTTTTACTATATGAAGCAGCCACTCACCACAGACCCTGTOATC 
5 ATGGACGGMTGATTTCTACTGCTGGGTTTGTCACCGGG 

CTMGTGTCTCAGACTGACATCGGMCCAGAGGGGGACTGGITITCT^ 

AGACCCAGAGTAMGCCATGACAATGCTCACCATTGMCAGTTATCCTACCTGCTCMGTTO 

CAGGGACAGATGCATTCCAGMGCCCGTTCCATTGGMCAGCACCCTGACTATGCGGMTACATOTTC 

GTACATTGGAAMGMTGCGAAAAAGAAAATGTATGGCTGC^^ 
10 GCATCATTTATMTGGGGGAMTCACAMTTGACGCAMTAG 

TCGAAGTATGTCCAGAATGmTCTAGCTGCTTGCCAAAAACGAGATM 

TGGTCTGGGCCAMCTGMGGGGTTTCCATTCTGGCCTGCAAM^^^^ 

TTGGACMCATGACAGGGCCTGGGTTCCAATAMTAATTGCTACCTCAT^ 

AGAGCATCTTCMCAGTGCCATGCMGAGATGGAGGTTTACGTGGAGMCATC 
15 CATTTAGGACACCCTACACACCCMCAGCCAGTATCAMTGCTGCTCGATCCCACCMCCCCAGCGCCG^ 

ACMGCAGGAGMGGTCMGCTCMCTTTGACATGACGGCATCCCCCMGATCCTGATGAGCMGCCTGTGCTGAGTGGGGGC^ 

CAGGCCGCCGGATTTCCTTGTCGGATATGCCGCGCTCCCCCATGAGCACAMC 

AGGATGCTGAGMGMGGCCACGTCGAGCCACTTCAGTGCGAGCGAGGAGTCCATGGACTTCCTGGATAAGAGCACAGCTTCAC 
CAGCCTCCACCMGACGGGACMGCAGGGAGTTTATCCGGCAGCCCAMGCCCTTCTCTCCTCMCTGTCAGCTCCTATCACGA 

20 CGAAMCGGACAAMCCTCCACCACCGGCAGCATCCTGMTCTTMCCTGGATCGMGCAMGCTGAGATGGATTTGMGGAGC 
TGAGCGAGTCGGTCCAGCMCAGTCCACCCCTGTTCCTCTCATCTCTCCCAAGCGCCAGATTCGTAGCAGGTTCCAGCTGAATC 
TTGACAAGACCATAGAGAGTTGCAMGCACMTTAGGCATAAATGAMTCTCGGMGATGTCTATACGGCCGTAGAGCACAGCG 
ATTCGGAGGATTCTGAGAAGTCAGATAGTAGCGATAGTGAGTATATCAGTGATGATGAGGAGAAGTCTAAGAACGAGCCAGAAG 
ACACAGAGGACAMGMGGTTGTCAGATGGACAMGAGCCATCTGCTGTTAAAAAAAAGCCCAAGCCTACAAACCCAGTGGAGA 

25 TTAAAGAGGAGCTGAAAAGCACGTCACCAGCCAGCGAGAAGGCAGACCCTGGAGCAGTCAAGGACAAGGCCAGCCCTGAGCCTG 
AGMGGACTTTTCCGAAMGGCAAMCCTTCACCTCACCCCATAMGGATA^ 

CAGTCCATTTGGGCCTGGACTCTGATTCAGAGAGCGMCTTGTCATAGATTTAGGAGMGACCATTCTGGGCG 
AAMTAAGMGGMCCCAAAGMCCATCTCCCAMCAGGATGTTGTAGGTAAMCTCCACCATCCACGACGGTGGGCAGCCATT 
CTCCCCCGGAAACACCGGTGCTCACCCGCTCTTCCGCCCAM^ 
30 CGGTCACCGTCACGGCCCCGGCCCCCGCCGCCACAGGMGCCCAGTGAAAMGCAGAGGCCGCTTTTACCGAAGGAGACTGCCC 
CGGCCGTGCAGCGGGTCGTGTGGMCTCATCMGTMGTTTCAAA 

AGCAGCAGCAGCAGCAGCAGCAAMCCAGCAGCAGCAGCCTCAGTCTTCCCAGGGGACGAGATATCAGACCAGACAGGCTGTGA 
MGCTGTCCAGCAGAAGGAGATCACACAGAGCCCATCCACGTCCACCATCACCCTGGTGACCAGCACACAGTCATCGCCCCTGG 
TCACCAGCTCGGGGTCCATGAGCACCCTTGTGTCCTCAGTCMCGCTGACCTGCCCATCGCCACTGCCTCAGCTGATGTCGC 

35 CTGATATTGCCMGTACACTAGCAAMTGATGGATGCMTAAMGGMCMTGACAGAMTATACMCGATCTTTCTAAAM 
CTACTGGAAGCACAATAGCTGAGATTCGCAGGCTGAGGATCGAGATAGAGAAGCTCCAGTGGCTGCACCAGCAAGAGCTCTCCG 
AAATGAAACACAACTTAGAGCTGACCATGGCGGAGATGCGGCAGAGCCTGGAGCAGGAGCGGGACCGGCTCATCGCCGAGGTGA 
AGMGCAGCTGGAGTTGGAGAAGCAGCAGGCGGTGGATGAGACCMGAAGMGCAGTGGTGCGCCMCTGCAAGMGGAGGC 
TCTTTTACTGCTGTTGGMCACCAGCTACTGTGACTACCCCTGCCAGCM 

40 AGTCAGCTACTGCTCCTCAGCAGGAAGCGGATGCTGAGGTGAACACAGAAACACTAAATAAGTCCTCCCAGGGGAGCTCCTCGA 
GCACACMTCAGCACCTTCAGAAACGGCCAGCGCCTCCAAAGAGMGGAGACGTCAGCTGAGAAMGCAAGGAGAGTGGCTCGA 
CCCTTGACCTTTCTGGCTCCAGAGAGACGCCCTCCTCCATTCTCTTAGGCTCCMCCMGGCTCTGACCAT^ 
MTCCAGTTGGAGCAGCAGTGATGAGMGAGGGGATCGACACGTTCCGATCACMCACCAGTACCAGCACGMGA 
CGAAAGAGTCTCGGCTGGACACCTTCTGGGACTAGCAGTGMTCGGGACACAMCCACCCACCCCATTGGG 

45 CGCCAGGAAAAGMGAMCMGAMGGCAGGAGMCAGCCACTTTCAGACTTGAAMTGAGAAMCCCTCAGTO 
CCCCGGCGCGGGGGCTGCTACACTACAGGACACCCAGCATCGGCTTTGACTGCAGACTGTTCACCCACAC 
TGGTGTAMTAATGTACAATTTGTGGATGTCATTGAATCT^ 
AAAAAAAAAAAAAGAAAAAGAAAAACGATTT 

141 

50 MALSSQIWMCLLLLLLLASLTSGSVFPQQTGQLAE^^ 
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142 

TCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACG 
TCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCC^^ 

GAGCCAGGGCCAGCTGGATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTO 
5 GCTGTCATCGATCAMGTGTGGGATGTGCTGCAAGAOT 
CCTGCTGCCCCAGMCATAGGTCTTGGMTAAMTGGCT 

144 

amgtccttgtmcattgagttacagggctttmctcct^ 

tmggccccgtagmgatgccaatcaamtaaactgcattcctgaggcactaggcmgamttamgctattcmctcctcmg 
10 . gcccagggactattgcggaagaggtgggcgcgtmgattgtmgggccgattttga^ 

aactaatcattcmgtcamggcacactgatgcaaaatcagtatatgggacccctgtgtctgga™ 
aagcctttmccmctccttccntamgggtt^ 

145 

MGLPRGSFFWVLLLbTAACSGLLFALYFSAV^^ 
15 l^SIPVLLWGDLFTPALWDRLSQHKAPYGWRGLSHQVIASTLSLLNGSESAKLFAPPRDTPPKCIRCA 
IDAHDYWRLNGAVIKGFERDVGTKTSmFT^ 
LDKGDRPHAYFGPEASASKFKLLHPDFISYLTERFLKSKIiINra 
YFERKMKPLIFYANHDLSLEAALWRDLHKAGI LQLYQR 

146 • 

20 GGGACGTCAGCGGACGGGGCGCTCGCGGGCCGGGGCTGTATGGGGCTCCCGCGC 

ACGGCTGCCTGCTCGGGGCTCCTCTTTGCCCTGTACTTCTCGGCGGTGCAGCGGTACCCGGGGCCAGC ■ 
ACCACATCATTTGMGCATTCTTTCMTCCMGGCA^ 

CTGGCCATTCAGCGGCACCCCCACTTCCGTGGCCTGTTCMTCTCTCCATTCCAGTGCTGCTGTGGGGGG 
GCGCTCTGGGACCGCCTGAGCCMCACAMGCCCCGTATGGCTGGCGGGGGCTCTCTC^ 
25 CTTCTGMCGGCTCAGAGAGTGCCMGCTGTTTGCCCCGCCCAGGGACACCCCTCCAAAGTC 

MCGGAGGCATTCTGMTGGGTCCCGCCAGGGTCCCAACATCGATGCCCATGACTATGTATTCAGACT 
■ AAAGGCTTCGAGCGCGATGTGGGCACCAAGACTTCCTTCTATGGTTTCACTGTGAACACGATGAAGAACTCCCTCGTCTCCTAC 
TGGAATCTGGGCTTCACCTC^ 

CTGAGATCGGCCATTCTGGXSCGTGCCiGTCCCTGAGGGCCTAGATAM 
30 ■ TCTGCGAGTAMTTCMGCTGCTACATCCGGACTTCATCAGCTACCTGACAGAMGGTTCTTC 

CATTTTGGAGACCTATATATGCCTAGTACCGGGGCTGTCATGCTGCTC^ 

GGATTCATCACMGCAACTACTGGAMTTTTCCGACCACTATT^ 

GATCTGTCCCTGGAAGCTGCCCTGTGGAGGGACCTGCACMGGCC^ 

GAGCGCTTTGCTTCTTCAAGAGTTGCGGCCCTGATCCTCTC^ 
35 TTGCCMCAGAGGGCACTGGGGTGMTTCMGATTTTCATCGAGGTCTG1TC 

TCCAAGMCTCTTCTGTATCTAAMCMTACATCTCMTC^ 

AACTTTAGGAMTGTCGGTGGAGTGTTCAGCMGATCAGACAGCAGTCCAGGTCAAAGGCAMCACACACGCTCCAGCCCAAAT 
CCTCCTGGTGGCACATCCTACCCCAGATGCTAMGTGATTCMGGACTC^ 
TAGGCTTACTTCTGCTCCATAATAMGTCGGAGAAAAMG^ 
40 ATTTTTTTTTTTTTTGGCTGGG 

GTCTGGCACTTGATGGGGGTTAGTTGCTTGGCAGCCTGCCT^ 
ACCMCAGGATGATGTCACCMCAGGATGATGTCACCAGGTMTA^CCTTCATCCTCAC 

• 148 

GGGGATMGTTTTTTTTTTTTCTTTTTTT^^ 
45 AGACTATTTAGAGTTGACATTTGACATTTTMTGGGCGCCATGGCTCATTTO 

AGTCTCATC^TGACAGCGTGCTGACAGCTGGGAGTCTGTGGCCTTCCTCACGCAGAGG 

CTAGGCTGGGCAGGGATGGGACCCATGCCCCCTCCTTAGAGGACGGGCTTC^ 

ATAATAGTTCACTGGTCACCGTGCTTTTATGAGTAGTGTTTTTC 
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GATTTMGCAATGGTGTCTGGAGm 

AAMTACACACAAGCAAAAAAAAAATGATAGTTTCACATCTCTTAGTTC 
CCAGGATTTTCCTACATAGTCAGAACTTACACAmCTAGAG^^ 
CCAGCCACAAATACCCACATTGGAMGACCCATTTGTGATGGGTAM 
5 TGCATGTGTTCATGACCTC^ 

CTGMGAGGGGGATGMGTGGGTCTCCAGGGAGGCAGTGGGGGCCTTGTTGGCAGCTGGCTCGG 
AGCTCTGCAGTTGGGAGGGGCACCGTCCGGAGGAGACCAGGCCTCTACACACCCCCCACTCTACTTATCATCCCTGCTC 
CCCTTGTCCMGGCTTTATGCATCGGATTTATTTTTCCAMTCGAGAGGACAGTGATA 
AMGGTCGCTAMTGTATACTGTTGTCAGMTTGCTGAGATCTCCCCCCACOT 
10 CTGTGACTTATTTTCTCTCTCAGGCAGCCAGCCACCTGGTC ' 

ccaggggtgaccgcaggat(^tgggggcagcgggcttctccacctaccccagccaccm 

cttcagcacatccctgtgcacagctggmgggtgcatggcccgctcacctc^ 

ctcctccctgccgtgcccctgccacggagcaggcattgtgmctggct 

mcccacagtggagactggagacagggcaatgagtctggtcgggggcacgtggacatc 
15 acaggcmggtcctgggcattgcgcgacgcaggactcmtgctaaagcmgcctgcctggctctgtgccagggcccctcttc 

atttacacatcccatttttacacagacccttccttc 

agacagggagtgaggggcctttgtgcccmctccagcacagctgcgttct^ 

tggtggtctcgtgagacagttccgaggac6ggk3aaattgcagggtggtgggggcgtgag 

ttcgcctgcagacc&atctggatataca^^ 
20 agattatatgtmggctctgmgggagagggagatgtacattctgccaggctcctggggaccttatgcgag 

gactgttgatccagtggtgcmgaagctacactccatgtgtcatcacgcttatgactcctmtc 

tcagccgactccatcttcacccctcgattcctcgagtccagcctttctgtgccag 
• tcgggacccggctgggcctggagtctcggggcacagttgccat^ 

ctgaaaaccccacaggagatggagtacctogccaagcttaaagagmgattttctcagggtat™ 
25 gtcaggmgcaggatggamgatgcattcagactgtt^ 

ttmgtttgggacttattttcccatttgagmgttat^ 

gcttcmtgagtttaaggagcactmggctmtgataccmto^ 

cagtamtattttcttctactgmcatggagccattattmgagttgtgtgttttt^ 

ttgtttgatgttctatttttctmt^gttttc 
30 mtcaggttggtgtctgctgggtctctcctgctttt^ 

atgtgamctgccctgcctcccctttto 

mctcttttgtggtacattatctcatgcttctgcamttc^ 

150 

mgatctatgtcctggggggccgccagggcmgctcccggtgactgcttttgm 
35 cggcatccmgcctacccagccgtcgggcctttgctggctgcgccatggctgmggca 

cagcctgggccccacmcttctactctcgcccacactto^ 

ctgttctgggctgtcctcccgctctctgtgggatggaggggcatagtc 

ggtcctggaccamttgccccgcagcctgcgcatgagggatmgagggcagactttgtggttg 

tggccattgggggccttggtmgtctctatggggctggggagaggagggagtcccmgacagg 
40 gtgtcaccttctgcccatctccaggcactccaggggtcagggcto 

ggtctgcctggcccagcactcagggtgttcctcagtgactgggggctctc 

ggacamtgaccactgtatgcacagtcmgagmcatatagamcataaatgmtgmcamtgatgmcata 

gcmggccctgatagctcatcctggggtgatmctactgggctattctctgggcttagggagcag 

ggcctgtgcctctcacagccttctctc^ 
45 cacggcggcgctgggaggcattgcctgccatgcccactgcccgctgcto 

ttattgggggtgtggcccagggccccagtcmgccgtggaggca 

gtccactggagcagctcattgccagaggcagctattot 

tgagagaggcatgggggtgagcacttgamcactgccttggggccttgggttaggggagcct^ 
. , atatgcttacacctacctwatcaccattcgttcatgmtcatgcctagctccatccttc 
50 tccmctgggamtggggagmgcamgctggccccatgctcttcagggtcagttcctatc 
gttgccattcctgaaamtctcagctgccaggctgcctttagggtccctgtagaccc^ 
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GAGAGMTAGAGAGGATGTGGGMCTGCCAGAC&GCCGGAGC 

ACTGCTGGTTGTATGACCTTGGACMGTCACTTCACCTCTC 

TTCCTACCCTACCTACCTCACAGGGCTCTTG^ 

151 

5 MVRSRLTAVSASVA?QMPPADMGRRKS1G*KPPPKKKM^ 
LSEPVDVYSDWIDACEAANQ 

152 

GGCACGAGGGGCTCATCCAC€TGCAGACATGGGGCGCAGAMGTCAAAACGAAAGCCGCCTCCCAAGAAGAAGATGACAGGCAC 
CCTCGAGACCCAGTTCACCTGCCCCTTCTGCMCCACGAGAMTCCTGTGATGTGAAMTGGA 

10 .CATCTCTTGTACCGTGTGCCTAGAGGMTTCCAGA 

AGACGCCTGCGAGGCGGCCAATCAGTAGCGACACAGAGGACCCGCCCCCTGAGCAGCCCCGCGTACTGTGGATCCAGCTGTTCG 
GTTCTGGTCCAGAGACATTCCAGGGGTCCAGGGTGTGGGTCCTGGGCTGTCACAGCCGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTAGTGGGTGTGCGTGTGGGTGTO 
TGGCCGTGAGTGTGTGCTCACAGGTACGCGGTGGTGTCGGGTO 

15 MGCCTTTGCTGTGTTCCCTGCAGCCCCTGGCCCCCCAGC 

TGACATCCTGGACAAGGGAAGCCCACTAGAGGCTGGAACAGGACCTCTCCAGCCTCCTCACCAGCACCGTGCCCATCTCAACTG 
GACTTCCCGCCCTCCTTCTCCACCTTCTAGTGCCCGTGGCCGGGGATTCAMGCCGCCGTTCCCCAGGTCCCTGGGCTGG 
TGACAGGGAGCCGCCCCCCTCCCCATGGTMCCAGGMGCCCGTTTCATG^ 
• ATGGGGACAGCTGTCMTCACAAGCCCTTAAATAAAGCAGCCAGCGCACAAAAAAAAAAAAAAAAAA 

20 153 

MPPPSDIVKVAIEWPGMAQLLEIDQKRPLASIIKEVCDGWSLPNPEYYTLRYADGPQLYITEQTRSDIKNGTILQLAISPS^ 
ARQLMERTQSSNMETRLDAMKELMLSADVTFATEFINMD^^ 

SITFIKQIAGWSQP^VSII^RSIAILESmNSQSLYQKIAEEITVGQLISHLQVSNQEIQTYAIALINALFLKAPEDKRQ 
DMANAFAQKHLRSIILNHVIRGMPIKTEMAHQLYVLQVLTFNLLEERMMTKMDPNDQAQRDIIFELRRIAFDAE 
25 GTEKRKAMYTKDYKMLGFTNHINPAMDFTQTPPGMLALDNMLYLAKVHQDTYIRIVLENSSRE 
LQVGELPNEGRNDYHPI^FTHDRAFEELFGICIQLLNKTWKEMRATAEDFNKVMQW 

SEILRLRQSERMSQDDFQSPPIVELREKIQPEILELIKQQRLNRLCEGSSFRKIGNRRRQERFWYCRLALNHKVLHYGDLDDNP 

QGEVTFESLQEKIPVADIKAIVTGKDCPHMKEKSALK^^^ 

ELTKSDLDTLLSMEMKLRLLDLENI QI PEAPPPI PKEPS SYDFVYHYG 

30 154 

GGAGGGAGGTGTAGAMGAGGTACATGGAGAACMGTTTGTCAATC 
GTCTGTGTTTTTGTCTCGCAGMTTAGAGCCCATTGGGMC^ 

CCAGGTGCTMCGCCCAGCTCCTTGAMTCGACCAGAMCGGCCCCTGGCATCCATTATC 
TTGCCAMCCCAGAGTATTATACCCTCCGTTAT^ 
35 MTGGGACMTQTTACMCTGGCTATCTCCCCGTCCCGGGCTGCACGCCAGCTGATGGAGAGGACCC 
ACCCGGCTGGATGCCATGAAGGAGCTGGCCMGCTCTCTGCCGACGTGACTTTGGCTACTGAGTTCATC 
ATTGTGCTGACMGGCTCGTGGAMGTGGMCCMGCTCmTO 

CCTAGAGCTCATGGACCATGGCATTGTCTCCTGGGACATGGTTTCMTCACCTTTATTMGCAGATTGC 
GCCCATGGTGGACGTGTCAATCCTTCAGAGGTCCCTGGCCATCCTGGAGAGCATGGTCTTGAACA 
40 GATAGCCGAGGAMTCACCGTGGGACAGCTCATCra 

GATTMTGCACTTTTTCTGMGGCTCCTGAGGACAMCGACAGGATATO5CA 

MTCCTGMTCATGTGATCCGAGGGMCCGCCCCATCAAMCTGAGATGGCCCATCAGCTATATGTCCTTCMGTC 
TMCCTTCTGGMGAMGGATGATGACCMGATGGACCCCMTGACCAGGCTCAMGGGACA 
TGCATTTGACGCAGAGTCTGATCCTAGCAATGCCCCTGGGAGTGGGA 
45- MTGCTGGGATmCCMCCACATCMTCCAGCCATGGACOTACCC 
GTACTTGGCTAMGTCCACCACX2ACACCTACATC 
TGGCCGCAGTGCCATTGAGCTCACCAAMTGCTCTGTGAMTC^ 
CCACCCGATGTTCTTTACCCATGACCGAGCCT^ 
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gatgagggcaacagcac^ggacttcaacmggttatgc^ 
ctctttggatcagttcmgagcamttgcgtagcctc 
ggatgacttccagtccccgccaattgtggagctgagggagmgatccagcccgagat^^ 
gmccggctctgtgagggcagcagcttccgamgattgggmccgccgmggcmgmcggttctggtac^ 
5 gmccacmggtccttcactatggtgacttggatgacaacccacmggggaggtgacatto 
tgttgcagacattmggccattgtcactgggamgattgtccccacatgamgagaamg 
gttggmttggccttctccatcctgtatgaccctgatgagacctt^ 
gattgatggcctcagtgcccttctggggaaggacatgtccagtgagctgaccm 
gatgaagctgcggctcctggacctggagmcatccagattcccgmgccccaccccccatc^ 
10 ctttgtctatcactatggctgagcctggagccagamcgac^ 
attctggtgccttgtcttttgcttgtacagm 

cggaccacccagagtttcctcttggtccctgtctactmgagtcatgmggcagggtc 

ctgggattgggccacgaggmcamcagcagatgcccttgccttccagtccmgamctgcttcttc 

gccactcaccttttcctcctgagcctgctc^ 
15 cagaggacaggtgtgttgggcgtgttgagagccttgaagtgactacctgtatc 

ttttgtcttttttgatgagggctcactc 

gtctttagagccaccagatctctctggcccatacagatatcaga 
. cagcaagtcamgacctcctggccmgtcctgccctcttm 

ctagactgtggtgaggtcactagattctactgctctto 
20 cctgaattgcagccccctccagtttgctttcctttggccttccagaccccaggmgttg 

tctgccattggccatgatttcagggagctggctgaggccggctgaggccacacctgtgccagtc 

cacttgtaaaccacacacacagcctctctccctggacatacgttagcacattggcattcagtattgg 

tactacccmtgmgagtgtactatatattttcattactataggccatacttatacagac 

tacctatcttaggatggmccttggggaaamtaam 
25 ttgtaaccagagagcagccaggagcttcctgtcagtmccatgttttcaatamtactctttcatgt 

156 

AGAGCMTACTCMGGGAAMGGTGTTTATTTAGMCMTGAA 
TGTGCATAAAAATGGTGAAAATATTGGACTTTCTTGGCAAMGMGAM 
CATTTGTCTTTGGTTTGTGTATTATGTGTATGATTTTGAAA 
30 ATC 

157 

MNHQQQQQQQKAGEQQLSEPEDMEMEAGDTDDPPRITQNPVINGWALSDGHNTAEEDMEDD^ 
VLSPPCFVRNLPWKIMVMPRFYPDRPHQKSVGFFLQCNAES 
35 NFMAWSEVTDPEKGFIDDDKOTFEVFVQADAPHGVAWDSK^ 
SKSVPLALQRVFYELQHSDKPVGTKKLTKSFXSl^^ 

VDYRSDRREDYYDIQLSIKGKKNIFESFVDWAVEQLIX5DNKYDAGEHGLQEAEKGVKFLT 

INDRFEF PEQL PLDEFLQKTDPKDPANYI LHAVLVHSGDNHGGH YVVYLNPKGDGKWCKFDDDVVSRCTKEEAI EHNYGGHDDD 
LSVRHCmmWIRESKLSEVLQAVTDHDIPQ 
40 VKYTVFKVLKNSSLAEFVQSLSQTMGFPQDQIRLWPMQARSNGTKRPAMLDNEAre 
GATLPKFDKDHDVMLFLKMYDPKTRSLNY^^ 
LDELMIXSDIIVFQKDDPENDNSELPTAKEYFRDLYH^^ 
KSQGYRDGPGNPLRHWEGTLRDLLQFFKPRQPKKL^ 

CKKAVELGEKASGKLRLLEI VSYKI IGVHQEDELLECLSPATSRTFRI EEI PIiDQVDIDKENEMLVTVAHFHKEVFGTFGI PFL 
45 LRIHQGEHFREVMRIQSLLDIQEKEFEKFKFAIVl^RHQYINEDEYEVNLKDFEPQPGNMSHPRP 
YLEKAIKIHN - * 

158 

GTACGTGCGCGTCTCCCTGCCGCCGCCGCCGCCCGCCGCO 

(^GGAGGCCGCCCCGCraCCGCCGCCGCCGCCGCCGCCCCGGCTCGCCGCCGCCCGCCCGCCGGGCTCGCAGCCCCGGCCCCC 



WO 02/46465 



PCT/G BO 1/05458 



322 

ggccgcaggcgaggcccaggccgcggccgacatgaaccaccagcagcagcagcagcagcagaaagcgggcgagcagcagttgag 

cgagcccgaggacatggagatggmgcgggagatacagatgacccacca 

ggccctgagtgatggacacmcaccgcggaggaggacatgg^ 

tgtggagcgcttcagcagactgagtgagtcggtccttagccctccgtgt^ , 
5 gccacgcttttatccagacagaccacaccaaaamgcgtaggattctttctccagto 

gtcttgccatgcacaagcagtgctcmgatmtaaattacagagatga 

cttccataaagaaaatgattggggattttccaattttatggcctggagt^ 

tgacamgttacctttgaagtctttgtacaggcggatgctc^ 

cggcttaaagmtcagggagcgacttctoacatgaacagcctgctacagacgt 
10 gtacatgatgccaaccgagggggatgattcgtctaamgcgtcc^ 

tgataaacctgtaggmcaaaamgttmcamgtcatttgggtgggamcttc 

gctogtcgagtgttgctcgatmtgtggaamtmga^ 

camtggtgtcctatatccagtgtaaagmgt^^ 

camggamgaaaaatatattomtcatttgtggamtgtggcagtagaaca 
15 ggmcatggcttacaggmgcagagaaaggtgtgamttcctaacatt^ 

gtatgaccctcagacggaccaamtatcmgatcmtgataggtttgmttcccagagcagttaccacttgatgm 

aaamcagatcctmggaccctgcaaattatattcttcatg 

ggtttatctamccccamggggatggcamtggtgt 

tgagcacmttatgggggtcacgatgacgacctgtctgttcgacactgcactaatgcttacatg™ 
20 aamctgagtgmgttttacaggcggtcaccgaccatgatattcctcagcagt^ 

cgaggctcagaagcggaaggagcggcaggmgcccatctctatatgcmgtgcagatagtcgcagaggaccagttto 

ccmgggmtgacatgtacgatgaagaaamgtgamtacactgtgttcamgtattgmgaactcctcgcot 

tcagagcctctctcagaccatgggatttccacmgatc^ 

accagcmtgttagatmtgmgccgacggcmtaamcmtgatoagctcagtgataatgaaaaccct^ 
25 ggamcagttgatcccgagctggctgctagtggagcgaccttacccmgtttgataaagatcatgatgtmtg 
gatgtatgatcccaamcgcggagcttgmttactgtggg^ 
agttatgtgtgacagagcac&atttattcmgatactagcctto 
tcaggactatgacgtgtctcttgataaagcccttgatgmct^ 

amtgatmcagtgaattacccaccgcamggagtatttccgagatctctaccaccgcgttgatgtcat 
30 mtccctmtgatcctggatctgtggttacgttat 

cmcacagatccmtgttgctgcagittttcmgtctcmggttatagggatggcccaggtmtc 
aggtactttmgagatcttctacagttcttcmgot 
agactttgagmcaggcgaagttttaaatgtatat^^ 
gcatgggtgtgtccgggacctgttagmgmtgtaaamggccgtgga 

35 AGAMTTGTAAGCTACAMATCATTGGTGTTCATCAAGMGATGMCT^ 

TCGMTAGAGGAAATCCCTTTGGACCAGGTGGACATAGACAMGAGMTGAGATGCTTGTCACAGTGG 

GGTCTTCGGAACGTTCGGMTCCCGTTTTTGCTGAGGATACACC^ 

CCTGCTGGACATCCAGGAGMGGAGTTTGAGAAGTTTAMTTTC 

CGAGTATGAAGTAAATTTCAMGACTTTCAGCCACAGCCCGGT^ 
40 CMCAMGCCCCAMGAGGAGTCGCTACACTTACCTTGAAMGGCCATTAAAATCCATMCTGATTTCC 

AGGCGAGGACGGTGTGTGGGTGGCCCCTTMCAGCCTAGMGTTT^ 

TTCGCTGCTGGTGTTMTTCTATTTTATTGAGGCTGTTC 

GMGATGTTTTTATAAAGCTTCGATGCCMTGAGAGTT^ • 
AGMGAGGTTAGTGGATCGGGGGTCCCTGGCTCMGGTCTC^ 
' 45 GGGTCCCGTGCACCAGCCCTGCAGCTAGCMGTOT 

CITTTGTGCATCTGTTMGGCAAMCAGAGAAACTCACAAOT 

159 

MATFSGPAGPILSLNPQEDVEFQKEVAQVRKRITQR^ 
SKGYAFVEFESEDVAKIVAETMNNYLFGERLLECH^ 
50 ERLLRKKIiAKKGIDYDFPSLILQKTESISKTNRQTSTKGQVLRKKKKKVSGTLDTPEKT^ 
NDDDKDDEIVFKQPISCVKEEIQETQTPTHSEIKKRRRSSNQ 
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160 

TGGGAGCCCGACGTTTCCGGGAGCGCCGCGTGGTTAGCGTCGGCGGCTTTTGGCATGGCGA 
ATCCTGTCGCTTMTCCGCAGGMGATGTCGAGTTTCAAMGG 
CMGMCAACTTACTCCTGGAGTAGTCTATGTGCGCCACCTACC^ 
5 CAGTTTGGCACTGTGACACGGTTCAGGCTGTCCAGMGTAAAAGGACTGGAM 
TCTGAGGATGTTGCCAAMTAGTTGCTGAMCMTGMCMCTACCTGTTTGGTC 
CCTGAAAMGTACATAAAGMCTCTTTAMGACTGGMTATTC^ 

CGGMTCGGACACTMCACAAAAGCTACGGATGGAGGAGCGATTTAAAAAGAAAGAAAGATTACTC 
AMGGMmACTATGATTTTCCTTCTTTGATm 
10 GGCCAGGTTTTACGTMGMGMGAAAAMGTTTCAG&TACTC 
CCAGTTTGTACACCMCATTTTTGGAGAGGCGAAMTCTC 
TTCAMCAGCCCATATCCTGTGTAAMGMGAMTACMGAGACTC^ 
AGCMTCAGTGATTTTCAATGTATTATATTTCTTTTGAA 

162 

15 GTAAAAA1X5ACTTGGATTGAAAATATGTGGTAGCCTTTTTATTTCTACATTMG 
CAAAACCCATATGTGCAGTACTTTACTACTTTGGGAMGCTGCATCTT^ 
TTGMGAGGGTTCTG^TACGTTATTGTAGTTCCCAGTTTAATATAGTTC 
ACACTCTGGAMGTAATMTTACATCATMTCATTTATTTTTTAMCT^ 

ACATCTCAGAGCAATTTGGTTTTGGTGTATATGTTCTCMCAGAAMCCAGTGTTMTGMTATCATGCCTCAGCACTGTCACT 
20 TTTAAMCCTGTCAGGATCCCACCGTAAMTTGG 
GTCMTATCCAGGTTTATTGTGTCCTACTAm 

TATAGATTMTGACTGGAMGTTTTMTATATTTAGCCCAAAGGTTACTTTGMTTG 
GGTTTATTTGTTTAGCTTTACMGGTACACATMGTTAGGT^ 

GCTTCCTGTCACAGGTAATGGAAAACATMGTAGMTAGGTGACCTCTTAGTTTTGAACTTATTTMGTGTGGGGAT 
25 TCATCAGMGTGCTTACAGGGTTACTACCTGAGTTTACMTCTACCTGGTCATTATTTTATTTCT^ 
CTCCACTGTTTATATATTCATAATTAMCACATTGAGAATGGAACA 
TGTGTGTTTTTTGTTTTTTTC 

CTCACTGCMCCTCCGCCTCCTGGGTTCACGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACACACCACC 

ACGCCCGGCTMTTTTTTGTGTCTTTTTAGT^^ 
30 GATCCGCCCGCCTCGGCCTCTCAMGTGCTGGGATTCTGTGTGTTTTGTG 

TACAGGATGGTCTMTMCATGAAMCAGGCTAGTTTCAAGCMCAGCMTGTCGGTTGGAAAGCAGGCGTCATTTGCCTTGM 

AAMGCCTTTTGACMCATACAGGCATTCTTTTAAMCCAGGCTGAMC 

TTCTTAMCCTAGCACCTCTGTGTATTTGAAMTMTGAGACATCTTTCATTC 

MCCTCACTACCAMTGAGTGAMGCTTGATTMGAGTTCTTC^ 
35 AGMGGACAGAAAGGACTATTTTAMGTTGGACTGMGGAGAAAAMGCAAMTTCTTGTTTCA 

MGTTAMCCCCCGTMTCTTAMGAGAAMTCTTTGGAGGTTTTMTTAMCATTT^ 

CTTTAAGTGTCMTGTAGCATGTAAMGGCTTTGTACAGACAGGTAAMGTTCCATTO 

CATCTTTMCTTGAMTCAAMCTATCAGATTTTATTTTTGTATMm 

ATTGGCTTCTACAGATCMTMTTTAMTGTMCTAGTTGGG^ 
40 TCTGTAMTTGTMTAMTATATTTGCMTTATTAAATGTTAAGTGAT 

163 

. . MRDPPNRVPPTTEGTRGLLSCLPDVERATLTLLLDHLRLVSSFHAYNRMTPQNLAVCFGPVLLPA 
AVDFKHHIEVLHYLLQSWPDPRLPRQSPDVAPYLRPKRQPPLHL^ 

RDFLSGPDYDHVTGSDSEDEDEEVGEPRVTGDFEDDFDAPFNPHLNLKDFDALILDLERELSKQINVCL 
45 164 

ATGGCCCGGGACCCCCCAMCAGAGTTCCCCCCACCACTGAGGGCACCCGAGGGCTCCTCAGCTGCCTC 

GCCACGCTGACGCTTCTCCTGGACCACCTGCGCCTCGTCTCCTCCTTCCATGCCTACMCCGeATGA 

GTGTGCTTCGGGCCTGTGCTGCTGCCGGCACGCCAGGC^ 

GCAGTGGACTTCMGCACCACATCGAGGTGCTGCACTACCTGCTGCAGTCTTGGCCAGATCCCCGCC 
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GATGTCGCGCCTTACTTGCGACCCAMCGACAGCCACCTCTGCACCTGCCGCTGGCAGACC^ 
GGTCGAGGAGGCCCCGAMGCCCCCCGAGCAACCGCTACGCCGGCGACTGGAGCGTTTGCGGG 
CGGGATTTCCTGTCCGGGCCAGACTACGACCACGTGACGGGCAGTGACAGCGAGGACGAGGACGAGGAGGTCGGC 
■ GTCACCGGTGACTTCGMGACGACTTCGATGCGCCCTTCMCCCGCACCTGMTCTC 
5 CTGGAGAGAGAGCTCTCCMGCAMTCMCGTGTGCCTCTGAGCCAGATGACGGGGTGGGACCCCGGTTAG^ 
CCCAGTGGCTMGGCGGTGCCCTGGTGACCMGGAGAGCCAGACCTGTTGCTCAGGCCGAG 
A(^GGACCAGTCGCGTGTATGGCTGAGACTCATTCCCAGTTTC 
TGGGGATTTTAGGGACCAGCGGTTGTGACCATCTTTCCTGAGCACCM 
CGCTTGCTAGGCTGGCCTCTCTTGCCTCCCCTTGGCCGGGGCMCACCAGTTACTGTGAGCATC 
10 CTCTAGTCGGCCCTCTTGCTGCTGCCMCCAMTCAGTATTAGCTTTGAGCACTGCACTGTT 
MGACTAGCATGAGGCACTCTTTGTGGGGGGCAGCCCCTATCCTGGGTTCCA 
AGAMCAGCTGGTTTCCCCTACCCTTTCCCGGGGMGACCCCACGATTGGCCTCTAGTCAGC 
CCTTTCCMTCCCCATCTCCTTGCCCCCCCC 

GCCCCCTGMGCGTCAGGCTGGGGGCAGTCTGGTACGGAACATATTTATTGCCTCCATGCA 
15 TGGTGTGCGTGGACACGTCTGMGCAGGCGTGTGGGGCTCTTTC^ 
. CCCTGAMTCCCCMTMTGGTGCCTCAGTGGGCCCC^^ 
TCCGCACCTCCATCTTTGTGGATAATAAATAAATATGCACAGGT 

166 

TITTAGGATGGGGTGTGGGAATGAGGGTGTGTGTAGGGAGGGGGTGCGGGGTGGGGACGGlAGGGGAGCGT 
20 TAGTGTCATGCCTCTTTCACCACCACCACCACCACCGMGATGACAGCMGGATCGGCTAMTACCGCGTGTTCT 
GTGGGMCTTACAGATGACAGTTCTTGCATGGGCAGMCGAGGGGGAC 

GMGGAGAGACAGCTTCAGGAAGAAMCAAMCACGMTACTGTCGGACACAGCACTGACTACCCGGGTGATGAAATCATCTGC 

ACACTGMCACCCCCGTCACMGTTTACCTATGTCACMTCTTGCACATGTATGCTTGMCGACAMTAAMG 

AAGA 

25 168 

TTTTTTTTTGTATTGTATACACAGTGGAAAGCTGGTTTTATTTGGG 
ACGAGATCAGTCTTGCTTTTTAGAMGATTAGTTTGGCAGTTACTTATTTGTMCCAGM 
GGGAGMGTCAGGTGACTATTAGTCTGCGAGTMTTCTGGGACMGAGCA^ 
CCAGGTTTTGGCATAAAT 

30 169 

MDFPCLVfoGLLLPLVMLDFNYHRQEGMEAFL^ 
DETVGRELLLHLIDYLVTSDGKDPEITNLINSTRIHIMPSMN^ 
PEWAVMKVifoKTETFVLSMLHGGALVASY 
.TNGYSWYPLQGGMQDYOTIWAQCFEITLELSCCKYPR 
35 RKHICPYRTNKYGEYYLLLLPGSYIIOTTVPGHDPHITKVIIPEKSQNFSALKKDILLPF 
DHSAATKPSLFLFLVSLLHIFFK 

170 

GCATTTCTTCCTTCTGCGTATGGGACAGGArc 

CCGTGCCTCTGGCTAGGGCTGTTGCTGCCTTTGGTAGCTGCGCTGGATTTCMCTACCACCGCCAGGMGGGA 
40 TTGMGACTGTTGCCCAAMCTACAGTTCTGTCACTCACTTACACAGTAmGGAMTCTGTGAMGGTAGAM 

CTTGTTGTGGGGCGGTTTCCAMGGMCACAGMTTGGGATTCCAGAGTTCAAATAC 

GTTGGGCGGGAGCTGCTGCTCCATCTGATTGACTATCTCGTMCC^ 

AGTACCCGGATACACATCATGCCTTCCATGAACCCAGATGGATTTGMGC^ 

AGGGAAMTTATMCCAGTATGACTTGAATCGAMTTTCCCCGATGCTTTTGM 
45 GTGGCAGTCATGMGTGGCTGAAMCAGAGACGTTTGTCCTC 

TTTGATMTGGTGTTCMGCMCTGGGGCATTATACTCCCGMGCTTMCGCCTGATGATC 

ACCTATGCTTCAAGAMTCCCMCATGAAGAAAGGA 

TACTCTTGGTATCCACTCCMGGTGGMTGCMGATTAC^ 
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TGCTGTAMTATCCTCGTGAGGAGMGCTTCCATCCT 

CACCTAGGTGTAMGGGTCMGTTTTTGATCAGAATGGAMTCCATTACCCMTGTMTTG 
. - . ATCTGCCCCTATAGMCCMCAAATATGGAGAGTATTATCTCCTTCTC 

GGACATGATCCACACATCACAMGGTGATTATTCCGGAGAAATCCCAGAACTTCAGTGCT^ 
5 TTCCMGGGCMTTGGATTCTATCCCAGTATCAMTCCTTCATGCCCMTGATTCC 

GCTGCMCAMGCCTAGTTTGTTCTTATTTTTAGTGAGTCTTTTGCACATATTC 

CACATCACCACCTGGMTCAGGGATTGCTCACTCCAGGTTACTGCAACCCTMCTCACTCTAGTGGGAC 

TCCACGATCTTCCTGMGMGAGAMTGGATGTTTCC^ 

GTCTTGACGGTGMTGGMGACACTTACCTAACMGTACTGCTCATTTACACTCAAA 
1 0 TMGMGTTAAAACTTGAGAAGCAAAAMTGCCTGCAAAMGMGATCATTTTGT^ 

GMGATGMCATTTATTGATCTTCTACATACMGACTTCACCATAAGGCCAGGAGCAGTGCTC 

TTGGGAGGCCMGGTGGGCGGATCACCTTGAGGTCAGGAGTTC^ 
. MTATTAGCGGGGTGTGGTGGCGGGCACCTGTAGTCGCAGCCTTTCGGGAGGCTGAG 

CGGAGTTTGCAGTGAGCCGAGATAGTGCCATTGTACTCCAGCTTGGGCAACAGAGTMGACTC 

15 172 

AGGTTTGAGGGGGGATGCTGTCATGAAMGGMCTTTCGCTTTTCAGGTTATGTATGT^ 
CACATGCTGCTTCTGTGTMCACTGATTTGGTCA^ 

TATCCTTCACACCAAACTTAMTCACTGGCTACTGAGMTATGCAGTCTTTGCGCACMTO 
CAGGTCCCAGACACAATTTCTTCATGGCTTCTTCAGCCTTTCCACCCTCA 
20 ACCACACAGGGACACATCATCATTCTTTATGGMGMGGTGAGATAGAMCAMTMTGAGGTGTm 
.MTCCTAGACCTTAGGATCTTGAAATGGCCCCTTTTC^ 
GGAGAGCTCAAACTCATGAATAACN 

174 

tmtamgtgctgmtatctgtcatttattgmttgaatactgtacattacatcaaaa 
25 gtcmtatgtcatgacgttgcaccatcagmgttggggaccatctgtataagtt^ 

agcatgamgagttamcctccctccccacacatacacagox:acagggctgctgcgctgggtgggag^ 

176 

anntanagtttgagggamggaggtmgmtoacacamtmgtmtgtcatgtgcctaamtgactgmgattcagtam 
ccamtggaaaaaaatctcatttttttmttccnctcat^ 

MTAAAGGGAMTACAGCCTTTGTGMTGGGMGGMCTGTGGTTTCAGTTGCAGA^ 

TTGTCAGGCAGACACATGCACACGTATGTTTATTGCGGTACTATTCACMTAGCAMGACTT ■ ■ 

178 

AGGATGGGGTGTGACCTTAATCTACTGACAAAGACAAMGAAMTTTCMTTATATC 
35 AGGGAMCAGTATACCCCCCAGTMTGCAAMGAGGTATMTGGTCCTAGMGTGATAGGGTTGTGGACM 
CATTMCTGTTCAMTGTAAAGATCATMGGAGTGGGCACCCACTATTGAAMGAGGTGGGGCATCCTO 
ACCAACATAAATGCAATTATGCATGTCMCMGCCMGAAMGGGGACTTAGGGGGTCAGCAACCCTGG 
AGGGTGAGMGGTMGMCTTTGTTTMTTMGGATGAACCCTGTGGGTGATO 

TGTGG ... 
40 180 

CTATCATGMTTTTTGTTTCATTACAMTGATAGAAM 
TTATAMTGTTTATGMGCTCATTACATTATCTTTTTTAAAAMGTAAA^ 
CTTTTGATATAGATTTGAGGAMGAGTCTCCAGTTTGCA 
45 GGTGACACCTTCCNCTATAATTTCATTACCCGGATTGTMGGTTTCCCCC 

181 
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mgqqpgkvlgdqrrpslpalhfikgagkkessrhggphcwfveqalq^ 
valydfvasgdntlsitkgeklrvlgynhng^^ 

vresesspsqrsislryegrvyhyrintasdgklyvssesrfntlaelvhhhswadglittlh 

wemertdio^khklgggqygevyegvwkkysltvavktlkedtmeveeflkeaavmkeik^ 
5 tygnlldylrecnrqevl^vvllymtqissameylekknfihrdlmrnclvgenhlvkvad 

ikwtapeslaynkfsiksdvwafgvllweiatyg 

aeihqafetmfqessisdevekelgkqgvrgavttllqapelptktrtsrraaehrdttd 

llprkergppegglnederllpkdkktnlfsalikkkkktaptppkrsssfremixjqperrgageeegrdisngalaftpldta 

dpmspkpsngagvpngai^esggsgfrsphlwkksstltssria^ 
10 dlqstgrqfdsstfgghksekpalprkrage^sdqvtrgtvtppprlvkkneemdew 

wapasglphkeeawkgsalgtpaaaepvtptskagsgaprgtskgpaeesrvrrhkhssespgrdkgklsklkpapppppaas 

agkaggkpsqrpgqeaageavlgaktkatslvdavnsdaakpsqpaeglkkpvlpatpkphpakpsgtpispapvplstlpsas 

salagdqpsstafiplistrvslrktrqpperasgaitkgwl^^ 

qqmrnkfafreainklennlrelqicpasagsgpaatqdfskllssvkeisdivqr 

15 182 

atggggcagcagcctggaamgtacttggggaccamgmggccmgctoc 
mggmtcatcgaggcatgggggtccacactgcmtgtttttgtggmcm 

cctcagggtctgagtgmgccgctcgtogmctccmggaamccttctcgctggacccagtgaamtgaccccm 

gttgcactgtatgattttgtggccagtc^ 
20 cacmtggggmtggtgtgaagcccaaaccaaamtggccmggctgggt^ 

gagamcactcctggtaccatgggcctgtgtcccgcmtgccgctgagtatccgctgagcagcgggatcmt^ 

gtgcgtgagagtgagagcagtcctagccagaggtccatctcgctgagatacgaagggagggtgtaccattacaggatcmcact 

gcttctgatggcmgctctacgtctcctccgagagccgcttcmcaccct 

gacgggctcatcaccacgctccattatccagccccaaagcgcmcmgcccactgot^ 
25 tgggagatggaacgcacggacatcaccatgmgcacmgctgggcgggggccagtacggg^ 

aaatacagcctgacggtggccgtgmgaccttgmggaggacaccatggaggtggm 

amgagatcamcaccctmcctagtgcagctccttggggtctgcacccgggagcccccgttctatatcatcactc 

acctacgggmcctcctggactacctgagggagtgcmccggcaggaggtgaa^ 

atctcgtcagccatggagtacctagagmgaaamcttcatccacagagatcttgctgcccgam 
30 cacttggtgmggtagctgattttggcctgagcaggttgatgacaggggacacctacacagcccatgct^ 

atcamtggactgcacccgagagcctggcctacmcmgto 

gamttgctacctatggcatgtccccttacccgggmttgaccgttc^ 

mgcgcccagmggctgcccagagmggtctatgmctcatgcgagcatg 

gctgaaatccaccaagcctttgamcmtgttccaggmtc^^ 
35 gtccgtggggctgtgactaccttgctgcaggccccagag 

gacaccactgacgtgcctgagatgcctcactccmgggccagggagagagcgatcctctg^ 

ttgctccctcgaamgagcgaggtcccccggagc^ggcctc^ 

ttgttcagcgccttgatcmgaagmgmgmgacagccccmcccctcccamcgcagcagctccttccgggagatggacggc 

cagccggagcgcagaggggccggcgaggmgagggccgagacatcagcmcggggc^ctggctc^ 
40 gacccagccmgtccccamgcccagcaatggggctggggto 

tctccccacctgtggmgmgtccagcacgctc^ 

mgcgcttcctgcgctcttgctccgtctcctgcgttc^ 

gacttgcagtccacgggmgacagtttgactcgtccacatttggagggcacaaaagtgagm 

gcaggggagmcaggtctgaccaggtgacccgaggcacagtmcgcc^ 
45 gatgaggtcttcamgacatcatggagtccagcccgggctcca^ 

accgtggcccctgcctcgggcctcccccacmggmgmgcot 

gtgacccccaccagcamgcaggctcaggtgcacc^ 

aagcactcctctgagtcgccagggagggacmggggamttgtccaagctcaaacctgccccgc 

GCAGGGMGGCTGGAGGAMGCCCTCGCAGAGGCCCC&CCAGGAGGCT^ 
50 ACGAGTCTGGTTGATGCTGTGAACAGTGACGCTGCCMGCCCAGCCAGCCGGCAGAGGGCCTCAAAAA 
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t 

ACTCCAAAGCCACACCCCGCCMGCCGTCGGGGACCCCCATCAGCCCAGCCCCCGTTCCCCT^ 
TCGGCCTTGGCAGGGGACCAGCCGTC1TCCACTGCCTTCATCCCTC 
• CCTCCAGAGCGGGCCAGCGGCGCCATCACCMGGGCGTGGTCTTGGACAGCACCGAGGCGCTGTGC 

TCCGAGCAGAT(K5CCAGCCACAGCGCAGTGCTGGAGGCCGGCAAAMCCTCTACACGTTCTGCGTGAGCT^ 
5 CAGCAAATGAGGAACMGTTTGCCTTCCGAGAGGCCATCMCAMCTGGAG 

TCAGCAGGCAGTGGTCCGGCGGCCACTCAGGACTTCAGCMGCTCCTCAGTTCGGTGM 
TAGCAGCAGTCAGGGGTCAGGTGTCAGGCCCGTCGGAGCTGCCTGCAGCACATGCGGGCTCGCCCATACCCA 
AGMGGGACTAGTGAGTCAGCACCTTGGCCCAGGAGCTCTGCGCCAGGCAGAGCTGAGGGCCCTGTGGAGTC 
CTACGTTTGCACCGCCTGCCCTCCCGC^CCTTCCTCCTCCCC 
10 CCGTGGACTGCAGTCGGCATGCCAGGACCCGCCAGCCCCGCTCCCACCTAGTGCCCCAGACTGAGCTCTCCAGGCCA^ 
ACGGCTGATGTGGACTGTCTTTTTCATTTTTTTCTCTCTGGAGCCCCTCCTC 

AGMTGGMGCCTGAACTGAGGCCTTGTGTGTCAGGCCCTCTGCCTGCACTCCCTGGCCTTGCCCGTCGTGTGCTC 
TTTCMGMCCGCCATTTCGGGMGGGCATGCACGGGCCATGCACA 
C£TGCACTCGCCATTTCCTCACGTGCAGGACAGCTCTTC 
15 CTGGGCAGGTGGGAGCTGAAMGGATCGAGGCATGGGGCATGTCCTTTCCATCTGTCCACATC 
TCTTGTGACGTGCACTGTGAATCCTGGCMGAMGCTTGAGTCTCMGGGTGGC 

CAGCAGAATGGAGGCAGGGGACAAGGGAGGCAGTGGCTAGTGGGGTGAACAGCTGGTGCCAAATAGCCCCAGACTGGGCCCAGG 

CAGGTCTGCMGGGCCCAGAGTGMCCGTCCTTTCACACATCTGGGTGCCCTGMGGGCCCTTCCCCTCCCCCA 

ACAMGTAGATTCTTACMGGCCCmCCTTTGGMCAAGACAGCCTTCACTTTTCTGAGT^ 

20 CCTCTGTGTAGCCGCCCTGAGAGAGAATAGAGCTGCCACTGGGCACCTCGCGACAGGTGGGAGGAAAGGGCCTGCGCAGTCCTG 
GTCCTGGCTGCACTCTTGMCTGGGCGMTGTCTTATTTMTTACCGTGAGTGACATAGCCTCATGTTCTGTGGGGG 
GGAGGGTTAGGAAAACCACAAACGGAGCCCCTGAAAGCCTCACGTATTTCACAGAGCACGCCTGCCATCTTC 
CCCCAGGCCGGAGCCCAGATACCGGCGGGCTGTGACTCTGGGCAGGGACCCGGGGTCTCCTGGACCTTGACAGAGCAGCTAACT 
CCGAGAGCAGTGGGCAGGTGGCCGCCCCTGAGGCTTCACGCCGGAGMGCCACCTTCCCGCCCCTTC 

25 CAGCCTCGCACAGGCCCTAGCTTTACGCTCATCACCTAMCTTGTACTTTATTTTTCTC 

TTTATGCGGTTCTTACAGCACATCACCTCTTTCCCCCCGACGGCTGTGACGCAGCGGAGAGGCACTAGTCACCGACAG 

TGMGACAGAGCAMGCCCCCACCCAGGTCCCCCGACTGCCTGTCTCCATGAGGTACTGGTCC 

TGCCACTATATTTTACACGTATCTCTTGGTATGCATCTTT^ 

TGCCCTCGGGGGCCTGTGGTGGCTCCCCCTCTGCTTCTCGGGGTCCAGTGCATTTTGTTTC 
30 TTTTGAATCCAMTCTGTCCTCTGTAGTATTTTT^ 

184 

GTAGAAAAAMGGTCTCACTGTGCTCAGGCTGGTCTTGAGCTCCTGTCTGGGCMCTTGGCTAMCCTCAT 
ACGAAAACTAGCTGGGTGTGGTGGTGTGCATCCGTGGTCCCAGCTACTAGAGAGGCTGAGGTC 
AGGTCMGGCTGCAGTGAGCCATGATCTCATCACTGTACTCCAGCTTGG^ 
35 ACAAAAACAAAMCCAMGMTCTAATCTATCTAGGCMCTTCCAGACCTTAGGTTTGATCCCCACTTTGTCAC 
GTATGATGTTGGATCTCMTTTCCAAACAGTGACATGAGTACCATMCCTTTCAAAMGTATCTATC^ 
GCTGGCTATTGTTCTNAGTGCTTTGGGACAGATTMGTAAMCCAAGCAMCCAMGATCTCTGGCCCTC 
CTTGGTGAGGGCCCATTTACTTTTTCCTTCTTTTT 

186 

40 MCTTATMCATTTTGGGTTTATATTTMT^^ 

MCATTAMTCTCTTTTCCATGTCMTGTCTATAGTTTTTTTT^ 

TGMTACATTTNTCTAMTGTTTTTNCCC 

TTMCTNGGGGGGGACMCAAMCAAAMTCTMTTTAA^ 

187 

45 MEPQEERETQVMWLKKIFGDHPIPQYEVNPRTTEILHHLSERNRVRDRDVYLVIEDLKQKASEYESEAKYLQD 

ANLS STG SRYLNALVDSAVALETKDTSLASFI PAVNDLTSDLFRTKSKSEEIKI ELEKLEKNLTATLVLEKCLQEDVKKAELHL 

STERAKVDNRRQNMDFLKAKSEEFRFGIKAAEEQLSARGMDASL^ 

VKIEEAKREliDSIEAELTRRVDMMEL 
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tagaatmtgtctacaagaggatgtcmgamgcagagttgcatctgtctacagamgggc 
acatggactttctaamgcamgtcagaggmttcaga 
atgcttctctgtctcatcagtccttagtagcactatcaga 
mttggagtcctatttagacttmtgccgmtccgtctcttgctcm 
5 ttgmgotgmcttacmgmgagtagacatgatggmctgtgacaamgccamtamcatc 
tgmtaggactttacagagttcttt^ 
mttgatagaatatggtttctoctgtgtgttgmttt™ 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

190 

10 tttttttmcaamgaamgatamgtttattcac^^ 

gatttacmgcctaccttmtmgggtaaaggagcagggaggaamggcttctactggaagmca^ 
gacamtgtatttttaggagmcamgcaggagacmgatggttcttcatcatgm 
agccgtgamctcctccagamgggggctagtggmgtctc^^ 
gacamgcctctttctgcatctgttgattctcamtgtcttcagg 

15 192 

GAGAAMTGTATTTMTMTTATTAAAMTAAAAACAGCM^ 

AGTCTAGMCTGGCAGAGGATGGGACCCGTCMTTTAAAAAGCACTGCTTCTCCAGAACACCCTCAGAGGGGAAGCGAGGCAAA 
CCTGGGAACTAGAGCACCCAAGGGCCACAGGGCCTNNTACCCAAGGGAGACAGGGACAACCCCTGGGGCAGGGCNTTAAANCAC 
AGGTGAAGGMGCACACCTTTCAMGCAGCTGATGACTCTTGAMTCCTGAGGAMCACCCCTC 
20 GGCTCTGCACTTTCTTTGGGNTGCCNTACCG 

194 

GGCATTACMACCTTTTTGCACATMCAAT^ 

TGTGTGGATATACATMTTTGTTTMCCMTCCCCTATATTTTAATATTACATTT ' 
GGATGMTMTGACCTTGATTATTTCTTAAMTMGCCCCTAAM 
25 NCTCACTCTTCAAAGCTACCCTCTAGACCCTTATCTO 
ATGGGGCTTTCTCAAG 

196 

GGGTCMCAGMGAGACTGCTTACTGTTTATTMGTACACATTAAMCCTMGTGGTCATTTAM 

TTMTTTTATTACATATATGAACATAAAATGTATGAAAACAACTGCA • 
30 AGTTCTTACGCATGAAGTAGTATATTACATCTCTTGAAA 

GAAAGAMGAAACTGTGGTMGTCMGATGTACCTCTGCTACCCCTGACACAGCMGACCM 
ATGATGATCCCCTTTTTACTTMCGAGTTGTAAAACCCTAMGTAMTCCCTCCAM 

198 

TTTTTTTTTGTTTCTGTTTTGTTT^ 
35 GTTTAMTAAMTTACAAGGCAAMCTACCCCATCCCMGTGCGGTTGCTGTACATATCTACAGTACM 
GCCATTTMGMCATAGAMCTTTCTACAGACMTTTCACATACAGMTACGTACACCTTT^ 
CTTACAGAGAGCATTGTATTTTACATTTGCCTGCACGTTTCm 

ATTTATTAAAGTGACCATMGGTGCCCGANGCGGGTTT^ • 
GGGTGGGACAGGCTTTCGGGGACCTTTCTTCCCCTGGGNCACACGGGCAG^ 
40 GCTTTTNCTTTT 

200 

TTTTACATACTAGATTTATTAMGTGACCATAAGTGC^ 

CAGMTTGGTGGMCAGGClTrCGGGMCTTTCTTCCCCTGGTCACACGGCAGATC 
ACTTTTGCTTTAAGMGGATTTGTCTNGAGAMGGACTAGAGAGAMTC 
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ACTGGGTNGGAATGGAGCAGGGGCTAGAGGAATCTGGMGCTAATTCTm 
TCTGGGTCTTGAAGTTTGGAGCTTACGGAGGCGA 

201 

MNGKRPAEPGPARVGKKGKKEVMAEFSDAWEETLKKQVAI^^ 
5 NLDFHEKYNDLYKFQQSDDLKKRREPHVSTLRKILFEDFRSWLSDISKIDLEST1DMSCAKYEFTDALLCHDDELEGRRIAFIL 
YLVPSWDRSMGGTLDLYSIDEHFQPKQIVKSLIPSWNKLVFFWSPVSFHQVSEVLSEEKSRLSISGl^HGPSLTRPPNYFEPP 
IPRSPHIPQDHEIIiYDWINPTYLDMDYQVQIQEEFEESSEILLKEFLKPEKFTKVCEALEHGHVEWSSRGPPNKRFYEKAEESK 
. LPEILKECMKLFRSEALFLLLSNFTGLKLHFLAPSEE 
PEENETKKESSVPMCQGELRHWKTGHY^ 
10 RDRETLKFVKH INHRS LEQKKTFPNRTGFWDFSF I YYE 

202 

CACGATAAAGGGGACATGCCGGGAGTTGCAGTACCCTCAGGMGMGTCATTGTCATGGACATGGACCOT 

GATCCCAMCTTCATCCAAAGCCAAGACTTCTTAGMGG 

GATTTGMGMGAGAAGAGAGCCTCACATCTC^^ 

15 tctaamttgacctggaatcmccattgacatgtcctgtgctam^ 

ctggmgggcgccggattgccttcatcctgtacctggttcot 

atagatgmcactttcagccgmgcagattgtcmgtctcmtc^ 

gtgtcctttcaccaggtgtctgaagtgctgtctgm 

actcggcctcccmctactttgmccccccatacctcggagccct^ 
20 mccctacttatctggacatggattaccmgttcamttcaagmgagt^ 

cttmgcctgagamttcacgaaagtctgtgaggccttggagcatc^ 

aggttttatgagamgctgaggagagtmgcttcctgagatattgmggagtgcatgmg™ 

ttgctgctctccmcttcacaggcctgmgcttcatttcttggccc 

gamccactgatatcactgaagaagggactagccatagtcctcctgagccagagmtaatcagatggccatcagcaacaacagc 
25 cmcagagcaatgagcagacagacccagagccagaggaamtgamcaaagaaagmtcmgtgttcccatgtgccaaggggaa 

ctgaggcattggmgaccggtcactacactttmttcat^ 

ggctgtgaaggctgggagccagmtatggcggttttacttcttacattgccaaaggtc 

ccagamgcmttctttggcattggtctacagagacagagagactctgamtttc 

caamgaamccttcccamcagmcaggtttctgggacttttcattcatctattatgmtgacagcactc 
30 aaamtgtgacccttcgtmttactgggmgtctgaaagagctmgcatggagtcmggagaactacatggtagcttgcctgac 

agtgttcttaaaactggttgtcttttactaggactcatmtgattgtcctc 

tctcttgattaaaaaaaaaaaamgttggctttttttto 

tttmgttcmtttgtttttattgaggtaamtatttatmcataamc 

tggattmgtacattctcattgttgtccagccatcaccatcatccatctccagmgttttc 
35 ttgmcmtmctccccacctccccttcccctagcmcagccatactttttc 

catgtmgtggmtcatacagtattotccttttgtgactggtttcacttagcatamgtctttmga 

gtgtttcggtttttttagaaamctcatacgtgattgcagccgggcatggtggctcac 

CMGGCAGGCGGATCACCTGAGGTCAGGAGTTAMCACCAGCCTGACAAACATGGAGAMCCCCATCOT 
ATTAGCTGGGCGTGGTGGCACATGCCTGTMTCCCAGCTACT 
40 AGGTTGCAGTGAGCCGAGATTGTGCCACTGCACTCCAGCCTGGGCMCAAGAGTGAAACTCTGTCTC 

204 

GAGTAGGAGTGCCTCCTTGTCTGCACTGCTGGTATGGGGTTAGGCCAG 

TCCCTGGGGAMGGGAACAGAGTMGGCAGGCCTTGTTCTCACTGCCCTCTMGGGMCTTGGTCA 
CAGTTTCTCCAGTTCMTMTMGGACMGAGCTTTTCCCATGCATTCTCTTTCCC 
45 TAGMTTGAAMGGGAGAGTGTCTTCAGTGCMTGTGGCATCCTGGATTGGGTC 
ATTGGAMTCTGAAAAAAGTCTGMTTTTAGTTMTATACCMTTTC 
AGATGTTGACCTTGGGGTAGGCTGGGTGMGGGTATACAGG 
CATTCCAAMTAAMGTTTATTTAATTTAAAAAAAAAA 
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MNKFPRRI PQKSC PRI LCWSCQEVSPEVADAICQAI VLS APGPHAVLLVTQLGRFTO^ LVFTR 

KEDIAGSSLEDYVRETNNQAUWL^^ 

QERQVSQGQGSEDVPGEESWLEGLSQIQKESEEAHRCLLGKADL 

206 

5 GAAMCATTTTGCTGAAMTATMGCAMCATCGGCCTTGTCCTCCTTGTGTTCATA 

CCGTGAGAGTGCGTGGCCGGGAGACCAGAMCGTGGTCCTTTCTCTTGCCTGTG^^ 

TGAACAMTTGCCCAGGAGMTCCCCCAGMGAGCTGTCCCAGGATCCTGTGCTGGAGCTGTC 

CAGACGCTATCTGCCMGCCATCGTCTTATCCGCCCCAGGGCCCCACGCCGTGCTCCTGGTGACACMC 

ATGAGGATCAGCAGGTCX5TCAGGCGCCTGCAGGAGGTCTTTGGAGTGGGGGTTC 
10 AGGMGACCTGGCTGGCAGCTCCCTGGMGACTATGTGCGAGAGA 

CACGGCGCCATTGCGGCTTCMCMCAGGGCACAGGGGGAGGAGCAGGAGGCCCMCTO 

CCATTATGTGGGAAAACGMGGAGATTATTACAGCMCMGGCTTACCAATATACCCA 
• AGGAMGGCAAGTAAGCCAGGGCCAAGGCTCTGAGGACGTGCCTGGTGAGGAGTCTTGGCTGGAAGGACTGTCCCAGATCCAGA 

AGGMTCTGAGGMGCCCACAGATGCCTGCTGGGGMGGCTGACCTTO 
15 CCTTTGCACCCCCCTGTGTCCAGCCCTCTGTTTCTCTTTCCATCCCATGGAGTG^ 

GTAGGMGAGGGGCATGGGGCTTGAGGACAGGGTCCAGCATGCCCAGATCACATTCCAGTTC 

TCTGGGTCCTCCCTCCTCTGTGCCCCMCAGCACGCTGTCTTCCTATTGGACC 

TACTTGTCTATACTCTGATAGMCCAMGTTTCTTGAGGGC^ 

CTGCATTGGAGTCATGCAGTCACGTTTGGTGCGGCMCTCCATGCMGCTGTTGACCA 
20 ATGATGGTTTMGGTGCCTGTTGTGTCTGGTGGAGACAGTGCTGGGAGATGA 

CTGGACACTCATCCACTGTTGGAGATGCTCMCAGCTAGACACAGAT 

TCCCCCATCTTCCCTAGACACAGCAGACATCTGAGAMGCTTCAGCATTTCTCTTGCTMCAGATTCA 
CAGAGCACAGAGTTATTTGGTGTTTGCTGAAGACAGCCTTTGTGCCACMTCACTO 
ACTGMCATGCMCATTTATCATACATTCAGCTCTCATO 
25 TTCTACTTTTCTACTCCTGTATGMTAAMTATTGAT^ 
AGTCTTCATTTTTATATTTACCTTGTTATTCTCAGGCAT 
CAAMTCATATTGGTATTCTAGTTGGCMTGTC 

MGGCCATTTTATTTCTAAG0X5MCAGACTTGAMCTCCAGAGCTACTGMGTAAMGTTAGMTCA 
ATAGGAGATMTTTTGTAMTTTTGATGCTATTATTT^ 
30 GACCAMTGAGATCCATTCAGCGACTMCTGGCMGGCACTCAAMCATGTC 
AAAAAAAAAAAAAAAAAAAA 

207 

MVVIDVKMLSGFTPTMSSIEELEMGQVMKTEVKITO 
LTN 

35 208 

GGCACGAGGCCGCCTTGGGGTGTGTGCCTGCCCCGCTGCGATGGAGGTCCTMGA 

TAGATTGCAMTGACCTGCTTTTTTTCTGATCCCGAGCTGCGTTCGGACCCCTGTCG 

CGTGGGCAGATCTGAGCTTTCTTCTIH^ACACCCCATACCCACAGTC 

AGMGGCATCTGGATTTTCTCTTTCCTTGGAMTAG 
40 ACACTGGMTTCGCMTAMTCCAGTATGGTGGTTATAGATGTAAAMTGCTATCAGGATTTA 

MGAGCTTGAAAACAAGGGCCAAGTC^TGAAGACTGAAGTC^ 

GAGCAGACAGTTTCACTTTTTCTGTTGAGC 

ACGAAAMGATGGAGAAGCATTTCTTTTMCAMCTGATTC 

GTGAACAGTCTGCAGTGGGOTATGGTTTCTTGACMGTCTTATTO 
45 AAAAAAAAAAAAAAAAAAAACTCGAGACTAGTTCTCTCTCTCATAAAATTAATTAACCGCTCATTCA 

TCCMCATCTCCGCATGATGAAACTTCGGCTCACTCCTTGGCGCCTGCCTC 

ATGCACTACTCACCAGACGCCTC7VACCGCCTTTTCATCMTCGCCCACATCACTCGAGACGTAM 

TACCTTCACGCCAATGGCGCCTCMTATTCTTTATCTG 
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TACTCAGAAACCTGAAACATCGGCATTATO 
CAMTATCATTCTGAGGGGCCACAGTMTTACAMCTT 

TGAGGAGGCTACTCAGTAGACAGTCCCACCCTCACACGATTCTTTACCTTTCACTTCATC 
GCAGCACTCCACCTCCTATTCmCACGAMCGGGATCAAACMCCCCCTAGGMTCACCTCCCATTC 
5 CACCCTTACTACACMTCAAAGACGCCCTCGGCTTACTTCTCTTCCTTCTCTCCTT 
CTCCTAGGCGACCCAGACMTTATACCCTAGCCMCCCCTTAMCACCCCTCCCCA 
GCCTACACMTTCTCCGATCCGTCCCTMCAMCTAGGAGGCGTCCTTGCCCT^ 
CCCATCCTCCATATATCCAMCAACAMGCATMTATTTCGCCC^ 

CTCATTCTMCCTGMTCGGAGGACMCCAGTAAGCTACCCTTTTACCATGATTGGACMGTAGCATCCGTA^ 
10 • ACMTCCTMTCCTMTACCMCTATCTCCCTM 
AAAAAAAAAAAAAAAAAAAAAAA 

209 

MATVQQLEGRVMjVDSKGFDEYMKELGVGIALRM^ 
TVCNFTDGALVQHQEWI)GKESTITRKLKDGKLWECVMNNVTCTRIYEKVE 

15 210 

accgccgacgcagacccctctctgcacgccagcccgcccgcacccaccatggccacagttca^ 
. xtggtggacagcamggctttgatgmtacatgmggagctaggagtgggaatagctttg 
ccagattgtatcatcacttgtgatggtaaamcctcaccataaam^ 

CTGGGAGAGMGTTTGMGAMCCACAGCTGATGGCAGAAAMCTCAGACTGTC 
20 CATCAGGAGTGGGATGGGMGGAMGCACMTAACAAGAAMTTGAAAGATGGGAMTTAGTGGTGG 
GTCACCTGTACTCGGATCTATGAAAAAGTAGMTAAAMTTCCATCATCACTTTGGACA 
CTCAGTTCAATGAGCAMTCTCCATACTGTTTCm 
ACATGCAGCTATTTCAMGTGTGTTGGATTAATTAGGATC^^ 

212 

25 GAGAGMCTAGTCTCGAGTTTTTTTTTTTTTm^ 

ATAGTGTTAGAGTTTGGATTAGTGGGCTATTTTCTGCT 

TGCTTGAGTGGAGTAGGGCTGAGACTGGGGTGGGGCCTTCTO^ 

TGCCTGCTGCTGCTAGGAGGAGGCCTAGTAGTGGGGTGAGGCTT^ 

AGTTGGAGTGTAGGATAMTCATGCTMGGCGAGGATGAMCCGATATCGCCGATACGGTTGTATAGGATTGCTTC 
30 CTGTGTTGGCATCTGCTCGGGCGTATCATCMCTGATGAGCMGMGGATATMTTCCTACGCCCTCTCAGCCG 
GGMTAGGTTGTTAGCGGTMCTMGATTAGTATGGTMTTAGGMGATC^ 

ctgagtttatatatcacagtgagmttctatgatggaccatgtmcgmcmtgccacagggatgmta™ 
ctagtttgmgcttagggagagctgggttgtttgggttgtggctcagtgtcagtt^ 

TGMTATTGTTGTGGGGMGAGACTGATMTAMGGTGGATGCGACMT^^ 
35 TAGGGTTMCGAGGGTGGTMGGATGGGGGGAATTAGGGMGTCA 

ACCTTAGCCAAACCATTTACCCAMTAMGTATAGGCGATAGAAATTGAAACCTGGCGCAATAGATATAGTACCGCAAGGGAAA 
• GATGAAAMmTAACCMGCATMTATAGCMGGACTMCCCCTATACCTTCTGCATMTGM 

AAGGAGAGCCAAAGCTAAGACCCCCGAAACCAGACGAGCTACCTAAGAACAGCTAAAAGAGCACACCCGTCTATGTAGCAAAAT 

AGTGGGMGATTTATAGGTAGAGGCGACAMCCTACCGAGCCTGGTGATAGCTGGTTGTCCAAGATAGAArc 
40 TAMTTTGCCCACAGAACCCTCTAAATCCCCTTGTAMOT 

AMCCTTGTAGAGAGAGTAAAAMTTTMCACCCATAGTAGGCCTAAMGCAGCCACCAATTAAGAAAGCGTTCM 

CCCACTACCTAAAAMTCCCAMCATATMCTGMCTCC^ 

TTAGTATMGTMCATGAAAACATTCTCCTCCGCATM^^ 

TCTACMTCMCCMCMGTCAmTTACOT 
45 AGGMCTCGGCAAATCTTACCCCGCCTGTTTACCAAAMCATCACCTCTAGCATC^ 

GACACATGTTTMCGGCCGCGGTACCCTMCCGTGCA 

CTCCACGAGGGTTCAGCTGTCTCTTACTTTTMCCAGTGAMTTGACCTGCCC 

GAAGACCCTATGGAGCTTTAATTTATTMTGCAMCAGTACCT 

ATTTCGGTTGGGGCGACCTCGGAGCAGMCCCAACCTCXGAGCAGTACATGCTM 
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CTCAATTGATCCAATMCTTGACCAACGGAACMGTTACCC 

TAGGGTTTACGACCTCGATGTTGGATCAGGACATCCCMTGGTGCAGCGGCTATTAMGGTTCGTTTC 

CTACGTGATCTGAGTTCAGACCGGAGTMTCCAGGTCGGTTTCTATCTACTTCAMTTCCTCCCTGTACGAMGGACMGAGM 

ATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

5 214 

GTGTTTTTGGTGTCGTGTGATTATGTGATTGTTM 
TMTGTGTCCTTTGMGTC^ 

TATAGGTATOCTGATCGMGGAMGTTCAGTGMGATGCTGGTAGmACAGCACTCCGGAGMTTAAMC 
AGGTMCAGCTCTAAAMTTCAGACATTTGCCTTATTM^ 
10 GCACCCATCCTAGGAGMTTATCTGTGCAGMGTTCTGATTTACAGATTTGTGATCTTAGGGM 
AAATTTC 

216 

GMCACCCCGCTCATCTGAGAMGGGCAGCATOCTCCCTCAGCAMTCAGGGGGAGAGGGAGCTTTCCCAG^ 
TCTCTGATTCAGCGCCCAAAMCCCAGGGACAGGMGCCCAGCAGATTTTGCTC 
15 AMCACATCTCAGCTAGTTACCACCTCATTCATCTTTCCAGAGTGGTTA 
TGCAAATACAAMTCCCTGAATCTTAAAATAAATTACAGTTCTCTm 

217 

MSSHLVEPPPPLHNNNNNCEENEQSLPPPAGLN^ 

SSSRGSSHCDSPSPQEDGQIMFDVEMHTSRDHSSQSEEEVVEGEKEVEALKKSADWVSDW 
20 MRKSGAMKKGGI F SAEFLKVF I PSLFLSHVLALGLG I YIGKRLSTP SASTY 

218 

GCGGCGGACTCGGCTTGTTGTGTTGCTGCCTGAGTGCCGGAGACG 

GTCGTCCCACCTAGTCGAGCCGCCGCCGCCCCTGCACMCMCMCMCMCTGCGAGGAAMTGAGCAGTCTCTGCCCCCGC 
GGCCGGCCTCMCAGTTCCTGGGTGGAGCTACCCATGAACAGCAGCMTGGCMTGATMTGGCMTGGGAAAM 
25 GGMCACGTACCATCCTCATCCTCCATCCACMTGGAGACATGGAGMGATTCTTTTGGATGCACM 

TAGTTCCAGAGGCAGTTCTCACTGTGACAGCCCTTCGCCACMGMGATGGGCAGATCATGTTTGATGTGG 

CAGGGACCATAGCTCTCAGTCAGMGAAGAAGTTGTAGMGGAGAGMGGAAGTCGAGGCTTTGM 

ATCAGACTGGTCCAGTAGACCCGAAMCATTCCACCCMGGAGTTCCACTT^ 

GAGGAAMGTGGAGCCATGMGAMGGGGGTATTTTCTCCGCAGMTTTCTGMGGTGTTCATTCCATCTCTCTTCCTT^ 
30 TGTTTTGGCTTTGGGGCTAGGCATCTATATTGGAMGCGACT 

CCCTGGAMTGCGTGTGACCTGTGAAGTGGTGTATTGTCACAGTAGCTTATTTGM 

CTACCACCCTGTTTTTACATATCCMTTCCAGTMCCCTCAMTTC 

CTTATACCCTTTTTGGCCTGMGACATTTTAGMTTTCCT^ 

CGCAAATGCCACCAGCAGATTATMTTTTGTCGGCMTGCTATTATCTCTAATTAGTGCCACCAGAC 
35 TGGTATAMTTTTACTCTTCCMCATMCTACCATCTCTCTCTTAAMCGAGATCAOT 
ATTGTCTAGTTGTTTTTTTTCCCCCMGACAAAGGCAAGTT^ 

AAAAAAAMGGCAAGGCACMCAAAAAMTATCCTGGGCMTAAAAAAMTATTTTAMCCAAAA 
219 

^PSSKKLTGRLMLAVG&AVLGSLQFGY 
40 LFVNRFGRRNSMLMMNLLAWSAVLMGFSKLGKSFE^ 

LIAQVFGLDS IMGNKDLWPLLLSI I FI PALLQCIVLPFC PESPRFLLINRNEENRAKSVLKKLRGTADVTHDLQEMKEESRQMM 
REKKVTILELFRSPAYRQPILIAWLQLSQQLSGINAVFYYSTSIFEKAGVQQPVYATIGSGIVOTAPTW 
HLIGIAGMAGCAILMTIALALLEQLPWSYLSIVAI 
. CFQWEQLCGPYWIIFTVLLVLFFIFTYFKVPETKGRTFDEIASGFRQGGASQSDKTPEELFHPLGADSQV 

45 220 
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. TAGTCGCGGGTCCCCGAGTGAGCACGCCAGGGAGCAGGAGACCAMCGACGGGGGTCGGAGTCAGAGTCGCAGTGGGAGTCCCC 

GGACCGGAGCACGAGCCTGAGCGGGAGAGCGCCGCTCGCACGCCCGTCGCCACCCGCGTACCCGGCGCAGCCAGAGCCACCAGC 

GCAGCGCTGCCATGGAGCCCAGCAGCMGAAGCTGACGGGTCGCCTCATGCTGGC^^ 

AGTTTGGCTACMCACTGGAGTCATCMTGCCCCCCAGMGGTGATCGAGGAGT^ 
5 GGGAGAGCATCCTGCCCACCACGCTCACCACGCTCTGGTCCCTCTCAGTGGCCATCTTTTCTGTTGGGGGC 

TCTCTGTGGGCCTTTTCGTTAACCGCTT^ 

TCATGGGCTTCTCGAMCTGGGCMGTCCTTTGAG 

CAGGCTTCGTGCCCATGTATGTGGGTGAAGTGTCACCCACAGCCTTO 

TCGTCGGCATCCTCATCGCCCAGGTGTTCGGCCTGGACTCCATCAT^^ 
10 TCTTCATCCCGGCCCTGCTGCAGTGCATCGTGCTGCCCTTCTGCCCCGAGAGTCCCCGCTTC 

AGMCCGGGCCMGAGTGTGCTAMGMGCTGCGCGGGACAGCTGACGTGACCCATGACCTGCAGGAGATGM 

GGCAGATGATGCGGGAGAAGMGGTCACCATCCTGGAGCTCTTCCGCTCCCCCGCCTACCGCCAGCCCATCCTCATCGCTGTGG 

TGCTGCAGCTGTCCCAGCAGCTGTCTGGCATCMCGCTGTCTTCTATTACTCCACGAGCATCTTCGAGAAGGCGGGGGTGCAGC 

AGCCTGTGTATGCCACCATTGGCTCCGGTATCGTC^ ' 
15 GGCGGACCCTGCACCTCATAGGCCTCGCTGGCATGGCGGGTTGTGCCATACTCATC 

TACCCTGGATGTCCTATCTGAGCATOGTGGCCATCTTTGGCTTTGTGGC 

TCATCGTGGCTGMCTCTTCAGCCAGGGTCCACGTCCAGCTGCCATTC 

TTGTGGGCATGTGCTTCCAGTATGTGGAGC^ 

TCATCTTCACCTACTTCAAAGTTCCTGAGACTAMGGCCGGACC 
20 GCCAMGTGATMGACACCCGAGGAGCTGTTCCATCCCCTGGGGGCTGATTCCCMGTGTGAGTCGCCCCAGATCACCAGCCC^ ' 

GCCTGCTCCCAGCAGCCCTAAGGATCTCTCAGGAGCACAGGCAGCTGGATGAGACTTCCAAACCTGACAGATGTCAGCCGAGCC 

GGGCCTGGGGCTCCTTTCTCCAGCCAGCMTGATGTCCAGM^ 

AGACTGTTGCTCAAATCTATTCAGACMGCMCAGGTTTTATMTTOT 

GTCTCCTGTGCCCACATCCCAGGCTTCACCCTGMTGGTTCCATC^ 
25 TCTTCACCCAGCTMTCTGTAGGGCTGGACCTATGTCCTMGGACACACTMTCGMCTATGMCTACAMGCTTCTATCCCAG 

GAGGTGGCTATGGCCACCCGTTCTGCTGGCCTGGATCTC 

TCTTCCTACCCMCCACTCAAATTMTCTTTCTTTACC^ ■ 
CCAGTCTGGGCTGCCGGGTTCTAGTCTCCTTTGCACTGAGGGCCACACTATTACCATGAGMGAGGGCCTGTGGGA 
ACTCACTGCTCMGAAGACATGGAGACTCCTCCCCTC^ 
30 TAAATACAGACACTAAGTTATAGTATATCTGGACM^ 

' TATAMTGGCl'GGTTTTTAGAMCATGGTTTTGAMTGCTTGTGG 
MGTGGGGTTGCAACCACTGCMCGGCTTAGACTTCGACTCAGGATCCAGTC 
CTCAAAMTCTGTTTGATCCCTGTTACCCAGAGMTATATACATTCTTTATCTTC 
ATAGTTGGTGTTCAAAAAAACACTAGTTTTGTGCCAGCCGTGATGCTCAGGCTTGAMTCGCAT 

35 221 

MFIMGLGDPIPEELYEMLSDHSIRSFDDLQR^ 

RTEVFE I SRRLI DRTNANFLVWPPCVEVQRC SGCCKfNRWQCRPTQVQLRPVQVRKI EIVRKKPI FKKATVTLEDHLACKCETV 
AAARPVTRSPGGSQEQRAKTPQTRVTIRTVRVRRPPKGKHRKFKHTHDKTALKETLGA 

222 

40 GTGTGTGTGTGTGTGCGCGCGCATCTGAGAGAGAGAGAGAGAGA 

GGCCTCGGGGACCCCATTCCCGAGGAGCTTTATGAGATGCTGAGTGAC^ 
• CTGCACGGAGA(XCCGGAGAGGMGATCX3GGCCGAGTTGGACCTGAA(^TGA - 
TTGGCTCGTGGMGMGGAGCCTGGGTTCCCTGACCATTC^ 

TTCGAGATCTCCCGGCGCCTCATAGACCGCACCMCGCCAACTTCCTGGTGTGGCCGCCC^ 
45 GGCTGCTGCMCMCCGCMCGTGCAGTGCCGCCCCACCCAGGTGCAGCTGCGACCTGTCCAGGTGAGAM 
rGGAAGMGCCMTCTTTMGMGGCCACGGTGACGCTGGMGACCACCTGGCATGCMGTGTGAG 

CCTGTGACCCGMGCCCGGGC^TTCCCAGGAGCAGCGAGCCAAMCGCCCCAMCTCGGGTGACCATTCGGACGGTGCGAGTC 

CGCCGGCCCCCCAAGGGCAAGCACCGGAMTTCMGCACACGCATC^^ 

CATCGGCAGGAGAGTGTGTGGGCAGGGTTATTTMTATGGTATTTGCTGTA 
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gtttccctcgtccgtctgtctcgatgcctgattcggacggccmtg^ 

catcagcgggtctcctcccagcggcctccggtcttgcccagcagctcam^ 

tcttcccttmctccaagmcttgggatmgagtgtgagagagactgatggggtcgctcttt 

tgcacctggcctgggccacacctgagcgctgtggactgtcctgaggagccctgaggacctctc 
5 gaacccctggccagctctgaggggaggcaccrrccaggcaggca 

acagactggagaamcccctcccacggtgcccamcaccagtcacctcgtc 

cgttttcgttttgmgacgtggactcctcttggtgggtc 

gatggagtttgctgttgaggtggtgtagatggtgacc^ 

tgattcacctcttcctctggttcctttcatctc 
10 ccmggggctcttgmccccttattmggcccca^ 

caagggacctgctcatcatattccmcccagccacgactgccatgt^^ 

tgcagggagcactgttcacatcatagatamgctgatttgtatatttamtc^ 

mggatccttttcctmttcacacamgactc^ 
' ggagggagactgtggtaggggcagggaggcaacactgctctccacatgacctccatttc 
15 cccttccaggtgggtgtgggacacctgggagmggtctccmgggagggtgcagccctcttc 

cacttccccatctttgatccttctc^^ 

gaagggaamgatccccmgaccccctggggtgggatctgagctcccacctcccttcccacctactgcacm 
ccttccaamcctgcttccttcagtttgtamgtcggtgattatatttttgggggc 

TATTTATATTCCGTATTTAAAGTTGT 
20 '223 

MATLKIX>LIYNLLKEEQTPQNKITWGVGAVGM^ 
VTMSKLVI ITAGARQQEGESRLNLVQRNVN^^ 
RFRYLMGERLGVHPLSCHGWVLGEHGDSSVPWSGMIWAGV^ 

GLSVADLAESIMKNLRRVHPVSTMIKGLYGIKDDVFLSVPCILGQNGISDLVKVTLTSEEEARLKKSAOT • 
25 224 

TGCTGCAGCCGCTGCCGCCGATTCCGGATCTCATTGCCACGCGCCCCCGACGACCGCCCGACGTGCATTCCCGATTCCTTTTGG 
TTCCMGTCCAATATGGCAACTCTAMGGATCAGCTGATTTATMTCTTCTAAAGGMGMCAGACCCCCCAGAATAAGATTAC 
AGTTGTTGGGGTTGGTGCTGTTGGCATGGCCTGTGCCATCAGTATCT^ 
TGTCATCGMGACAAATTGMGGGAGAGATGATGGATCTO 
30 CAMGACTATMTGTAACTGCAMCTCCMGCTGGTCATTATCACGGCTC 
TTTGGTCCAGCGTMCGTGMCATATTTAMTTCATC 
TTCAAATCCAGTGGATATCTTGACCTACGTGGCTTG^ 

TCTGGATTCAGCCCGATTCCGTTACCTGATGGGGGAMGGCTGGGAGTTCACCCATTM 

ACATGGAGATTCCAGTGTGCCTGTATGGAGTGGMTGMTGT 
35 TGATAMGATMGGMCAGTGGAMGAGGTTCACAAGCAGGTGGTTGA 

ATCCTGGGCTATTGGACTCTCTGTAGCAGATTTGGCAGAGAGTATMTGMG 

GATTMGGGTCTTTACGGAATAMGGATGATGTCTTCCTTAGTGTTCCTTGCATO 

GMGGTGACTCTGACTTCTGAGGMGAGGCCCGTTTGM 

TTAAAGTCTTCTGATGTCATATCATTTCACTGTCTAGGCTACMCAG 
40 CTGATCTGTGATTAMGCAGTMTATTTTMGATGGACTGGGAAAMCATCM 

AMTCCACAGCTATATCCTGATGCTGGATGGTATTMTCTTGTGTAGTCTTCM 

TGACGCACCACTGCCAATGCTGTACGTACTGCATTTGCCCC 

CTCCTTCACTGMCATGCCTAGTCCMCATTTTTTCCCAGTGAGTCACATCCTO 

TTGTGCATMTTCTTCCAMGGATCTTATTTTGTGMCTATATCAGTAGTG 
45 CAMCMTGCMCCAACTATCCAAGTGTTATACCAACTAAAACCCCCMTAMCCTTGAACAGTG 

225 

METPAWPRVPRPETAVARTLLLGWVFAQVAGASGT^^ 

TDTECDLTDEIVKDVKQmARVFSYPAGNVESTGSAGEPLY^SPEFTPYLETNLGQ 
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NNTFLSLRDVFGKDLIYTLYYWKSSSSGKKTAKTNTNEFLIDVDKGENYCFSVQAVIPSRTVNRKSTDSP 
FYI IGAWFWI ILVI ILAISLHKCRKAGVGQSWKENSPLNVS 

226 

MGACTGCGAGCTCCCCGCACCCCCTCGCACTCCCTCTGGCCGGCCCAGGGCGCOT 
5 GCCACGGMCCCGCTCGATCTCGCCGCCMCTGGTAGACATO 

GTCGCTCGGACGCTCCTGCTCGGCIX^TCTTCGCCCAGGTGGCCGGCGCTTCAG 
TTMCTTGGAMTCMCTMTTTCMGACMTTTTGGAGTGGGMCCCAM 
ACTAAGTCAGGAGATTGGAAMGCAMTGCTTTTACACMCAGACACAGAGTGTGACCTCACC 
MGCAGACGTACTTGGCACGGGTCTTCTCCTACCCGGCAGGGMTGTGGAGAGCACCGGTTCTGCTGGGGAGCCTC 
10 AACTCCCCAGAGTTCACACCTTACCTGGAGACAAACCTCGGACAGCCM 

MTGTGACCGTAGMGATGAACGGACTTTAGTCAGMGGMCMCACTTTCCTMGCCTCCGGGA 
ATTTATACACTTTATTATTGGAMTCTTCMGTTC 

GATAMGGAGAAMCTACTGTTTCAGTGTTCMGCAGTGATTCCCTCCCGAACAGT^ 

GAGTGTATGGGCCAGGAGAAAGGGGMTTCAGAGAAATATTCTACATC^ 
15 ATCATCCTGGCTATATCTCTACACMGTGTAGAMGGCAGGAGTGGGGCAGAGCTGGM • 

TAAAGGMGCACTGTTGGAGCTACTCCAMTGCTATATTGCACTGTGACCGAGMCTTTO 

CAMTGAGTATTTCGGAGCATGMGACCCTGGAGTTCAAAAMCTCTTC^ 

GACATCAGCATTAGTCACTTTGAMTGTAACGMTGGTACTACMCCMTTCCMG 

TGCACATMCATGCTTTAGATTATATATTCCGCACTT^ 
20 AMAMTCCTGGGTGGACTTTTGAAMGCTTTTTTTTTTTTTTTTTT 

GCAGTAGCACGATCTCGGCTCACTTGCACCCTCCGTCTCTCGGGTTCMGCMTTGTCTGC 

TTACAGGTGCGCACTACCACGCCMGCTAATTTTTGTATTTTm 

GAATTCCTGACCTCAGTGATCCACCCACCTTGGCCTCCCAMGATGCTAGTATTATGGGCGTGMC 
AGCTTTTGAGGGGCTGACTTCMTCCATGTAGGAMGTAA^ 
25 GTCTATMTATAGTGTTTAGGTTCTTTTTTTTTTC 

TGTTAAGTGCAGGAGACATTGGTATTCTGGGCAGCTO 

MCATTTGAGAGCTMCTATATTTTTATMGACTACTATACAMCTACAGAGTTTATGATTT 

TTGACATTGTATATATMTTTTTTAAAMGGTTTTTOT 

ATATTGAGATAATTTATTTMTATACTTTAMTAMGGTGACTGGGAATTGTT 

30 227 

- MFLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFMDW 
GCCNDEGLECVPTEESNITMQIMRIKPHQGQHIGEMSFLQHNKCECRPKKDRARQENPCGPCSERRKHLFV 
DSRCKARQLELNERTCRCDKPRR 

228 

35 tcgggcctccgaaaccatgaactttctgctc^ 

gtcccaggctgcacccatggcagaaggaggagggcagmtcatcacgmgtggtgmgttc 

ctgccatccaatcgagaccctggtggacatcttccaggagtaccc 

cctgatgcgatgcgggggctgctgcmtgacgagggcctggagtgtgtgcccactgaggagtccm 

gcggatcamcctcaccaaggccagcacataggagag 
40 tagagcaagacaagaamtccctgtgggccttgctc^ 

ttcctgcaaaaacacagactcgcx5ttgcmggcgaggcagcttgagtta 

GTGAGCCGGGCAGGAGGMGGAC^CTCCCTCAGCGTTTCGGGAACCAGATCTCTCACCAGG 
229 

MSREMQDVDLAEVKPLVEKGETITGLLQEFDVQEQDIETLHGSVHOTLCGTPKGNRPVILTYH 
45 I TQHFAVCHVDAPGQQDGAASFPAGYMYPSMDQLAEMLPGVLQQFGLKS I IGMGTGAGAYTLTRFALNNPEMVEGLVLINVNPC 
AEGWMDWMSKISGWTQALPDMWSHL^ 

CPALLWGDSSPAVDAVVECNSKLDPTKTTLLKMADCGGLPQISQPAKLAEAFKYPV 
LDGTRSRSHTSEGTRSRSHTSEGTRSRSHTSEGAHLDITPNSGAAGNSAGPKSMEVSC 
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230 

tgmgctcgtcagttcaccatccgccc^ 
(x5cgttaggcaggtgacagcagggacatgtctcgggagatgcaggatc 

gggagaccatcaccggcctcctgcmgagtttgatgtccaggagcaggacatcgagactttacatggctctg™ 
5 tgtgtgggactcccmgggamccggcctgtcatcctcacctaccatgacatcggcatgmccacaam 

tcttcmctacgaggacatgcaggagatcacccagcactttgccgtctgccacgtcx3 

cctccttccccgcagggtacatgtacccctccatggatcagctggctgamtgcttcct^ 

amgcattattggcatgggmcaggagcaggcgcctacaccctmctcgatttgctct 

ttgtccttatcmcgtgmcccttgtgcggaagggtggatggactgggccgcctccaagatctc 
10 cggacatggtggtgtcccacctttttgggmggm^ 

tgmtgacatgaaccccggcaacctgcacctgttcatcmtgcctacmcagcc^ 

cgggmcccacacagtcaccctgcagtgccctgctctgttggtggttggggacagctc 

gcmctcaaaattggacccmcamgaccactctcctcmgatggcggactgt^ 

agctcgctgaggccttcmgtacttcgtgcagggcatgggatacatgccctcggctagcatgacccgcctg 
15 cagcctctggttccagcgtcacttctctggatggcacccgcagccgctcccacaccagcgagggcacccgaagccgctcccaca 

ccagcgagggcacccgcagccgctcgcacaccagcgagggggcccacctggacatcacccccaactcgggtgctgct^ 

gcgccgggcccmgtccatggaggtctcctgctaggcggcctgcccagctgccgcccccggactctgatctctc 

ctcctccccggccccttttcgccccctgcctgccatactgcgcctmctcggtattmtccamgcttattttc 

ctctggtggagacamtgaggtctattacgtgggtgccctctc 
20 tctccgcatcgcatcctcttccattmccagtggccggtoccactctcc^ 

mgacgcgtagcagcacacacttcacamgccmgcctaggccgccctgagcatcctggttcamcgggtgcctggtc 

ccagccgcccacttcccgtttcctctttmctgaggagaagctgatccagctttc 

• gaggggggtaatagtgacatgcaggcacctctttt^ 

gaggcatttggmcmcamtctacgtagttmcttgmgamccgatttttam^ 
25 gmgcamcaagcttgatttttctagcatcctcttmtc 

atatttcagcaaacgttgggcatcatggtttttgmggctttagtt^ 

ccctgatacatgagccagtgattattcttgttcagg 

attccacagctgccctggctgtgatgagtgtccttgcaggggccggagtaggagcac 

cgatgtmgggattccttgttgttgtgttc^^ 
30 gagattggcttamtccagttttcaatcttcgacagctgggctggaacgtgmctcagtagctc 

gttcttggatcctcagaactctttgctcttgtcggggtc^ 

attctggmtacatattccatggactttccttccctctcctgcttcctctttt 

gacaaacactgacgtttctgggtggccagtgcggctgccaggttcctgtactac 

ggattttctcataggaagtttggtcagagtgmttgmtattgtm 
35 gttgggaggaggmgtagtccagccttccaggtgggcgtgagaggcmtgactcgttacctgccgcccatc 

tccctggccttgagtagaamgtcggggatcggggcmgagaggctgagtac 

gaggagtttcttmtgagatatttgtatttatttccagaccmtamtttc 

231 

MIWYILIIGILLPQSIJMFGFFTSIGQOT^ 
40 FKLMKRLNTEWSELENLVLKDMSDGFI SNLTIQRPVLSNDEDQVGAAKALLRLQDTYNLDTDTI SKGNLPGVKHKSFLTAEDCF 

ELGKVAYTEADYYHTELWMEQALRQLDK^ 

AKEKDVNKSASDDQSDQKTTPKKKGVAVDYLPERQ 
. IIRFHDIISDAEIEIVKDLMPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPWSRINMRIQDLTC 

GVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGAT^ 
45 VLVGNKWVSNKWLHERGQEFRRPCTLSELE 

232 

GAGCGGGCTGAGGGTAGGMGTAGCCGCTCCGAGTGGAGGCGACTGGGGGCTGMGAGCGCGCCGCCCTCTC • 
AGGTGTGTGATCCTGTAAMTTAAATCTTCCMGATGATCTGGTATATATTM 
TCATCCAGGCTTTTTIACTTCMTTGG 
50 TATTMGGCAGAAGAGGACAAGTTAGMCAMTAAAAAMTGGGCAGAGMGTTAGATCGGCTAACTAGTACAGCGACAAMG^ 
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TCCAGMGGATTTGTTGGGCATCCAGTAAATGCATTCAM 
GGTCCTTMGGATATGTCAGATGGCTTTATCTCTMCCTMCCATTC 

GGCAGCCAMGCTCTGTTACGTCTCCAGGATACCTACAATTTGGATACAGATACCATCTCAMGGGTMTCTTC 

ACACAMTCTTTTCTMCGGCTGAGGACTGCTTTGAGTTGGGCAMGTGGCCTATACAGM 
5 GTGGATGGMCMGCCCTMGGCMCTGGATGMGGCGAGATTTCTACCATAGA 

TGCGGTATATCAGCAGGGAGACCTGGATMGGCACTTTTGCTCACAAAGMGCTTCTTGMCTAGATC 

TAATGGTAACTTAAAATATTITCAGTATATAATGGCTAMGAAAMG 

GAAAACTACACCAMGAAAAMGGGGTTGCTGTGGATTACCTGCCAGAGAGACAGMG 

TATCAAMTGACCCCTCGGAGACAGAAAAMCTCTTTTGCCGCTACCATGATGGAMCCG 
10 AGCTAMCAGGAGGATGMTGGGACAAGCCTCGTATTATTC 

AGACCTAGCAAAACCMGGCTGAGCCGAGCTACAGTACATGACCCTGAGACTGGAAMTTC 

TAAGAGTGCCTGGCTCTCTGGCTATGAAMTCCTGTGGTGTCTCGAATTMTATGAGMTACMGATCTMCA 

TTCCACAGCAGAGGAATTACAGGTAGCAMTTATGGAGTTGGAGGACAGTATGM 

GCCAGATGCTTTCAMGAGCTGGGGACAGGAMTAGMTTGCTACATGGCTGTTTTA 
15 CACTGTTTTTCCTGMGTTGGAGCTAGTGTTTGGCCCAAAAMGGMCTGCTGTTTTCTGGTA 

AGGAGATTATAGTACACGGCATGCAGCCTGTCCAGTGCTAGTTGGCMCAAATGGGT^ 

ACAAGMTTTCGMGACCTTGTACGTTGTCAGMTTGGMTGACAMCAGGCOT 

TGTCTGATATACACATTTCCATAGTCTTMCTTO^ 

CATCCCATGTTTCATCTGTGGACMTTCCTTAC 
20 - TAMGTTCTGTTGGTATCACAGMGACMGGCAGAGTTTAMGTGAGGAATTmTATTTAM 

ACATMTTTGAGCATCCAGTTTTAGTATTTCACTACATCTCAGTTO 

AMTGCCTTACAGATGTGCCTAGGTGTTCTGTTTACCTAGTGTCTTACTCTGTTTTCTGGATC 

GACACTMCTGGGTTCCATGTGATTGCCCTTTCATATGATCTTCTMGTTGATTT^ 

TATACTGTATTTTACCMCCCCCTCTCTTTTC 
25 TTTTTTTTTTTTTTMTGGAMGTCCTGCATMCM 

TCCCTTTTTTAAMTGACAGATGATTTTGTTCAGGAA 

CAGCAGGGTGGTMTATTGGCATTCTGATTAMTACTGTGCCTTAGGA 

ATGAGGGMTTGATTTTTTTTAAMGTCTTTTTCTTAGAMGCCAAMTGTTTC 

AACAATGACCTATTTATGATCTTAAATCT'TTTTT 

30 233 

MSQNGAPGMQEESLQGSWELHFSNNGNGGSVPAS 
HSIGEKNSSQSEEDDIERRKEVESILKKNSDWITOWSSRP^^^ 
LLLSHLLA IGLG I Y IGRRLTTSTSTF 

234 

35 CCTCCGCTCAGTCCGGGAGCGCACGTGGGCCGCGGCGCTCCGACCTCCGCTTTCCCACCGCCCGCAGCTGAAGCAC 
GCCCGGCGCGGACTCCGATCGCCGCAGTTGCCCTCTGGCGCCATGTCGCA 

CAGGGCTCCTGGGTAGAACTGCACTTCAGCMTMTGGGMCGGGGGCAGCGTTCCAGCCTCGGTTTCTATTTATM 
ATGGAAAAAATACTGCTGGACGCACAGCATGAGTCTGGACGGAGTAGCTCCAAGAGCTCTCACTGTGACAGCCCACCTCGCTCG 
CAGACACCACMGATACCMCAGGGCTTCTGAMCAGATACCCATAGCATTGGAGAGAAAAACAGCTCACAGTCTGAGGAAGAT 
■ 40 GATATTGAMGMGGAAAGAAGTTGAMGCATCTTGMGAAAM 

ATTCCCCCCMGGAGTTCCTCTTTAMCACCCGMGCGCACGGCCACCCTCAGCATGAGGAACACGAGCGTCATGM 
GGCATATTCTCTGCAGMTTTCTGAAAGTTTTCCTTCCATCTC 

ATTGGMGGCGTCTGACMCCTCCACCAGCACCTTTTGATGAAGMCTGGAGTCTGACT^ 
GCTTCKMCATAGCTCACTGMGAGCTGTTAGATCCT^ 
45 CCTMTTTAGTAAMTAAMGAATAGACACTAAAATCATGTTGATC 
GATTMTGTCTACTGTGAAAATTTGGTAGTAMTTTTCATTTC 
CTAGTCATGATGTGTGTTGTATTTTAAAMTTATCTGCMCCTTMTTCAGC 
CATTGATMTAAMTCGCTACTOTMGGGGTTTGTCC 

ATTTMTTTAMTGGATGCAGGTTGTCTACTAMGAMGATTATATATAACTATG 
50 ATAGAGCTACAMCTCAGCTGTACAGTTCGTACACTAMCTCTTCTTGCTTTTGCATTATMG^ 
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GGTGATCACCCTGGATGATCAGTTTTCTGCTGMGGCACCTACTCAGTATCTTTTC 

AATCCTCTCCTTTGTGTTCMCTTTTC 

AAATGGAAAAAAAAATAAATCTT 

5 235 

MQMSPALTCLVIGIALVFGEGSAVHHPPSWAH^ 
MGFKIDDKGMAPALRHLYKELMGPVWKDEISTTm 
GMISNLLGKGAVDQLTRLVLWALYFNGQW^ 
. GDTLSi4FIMPYEKEVPLSALTNILSAQLISHWKG2^TRLPRLLVLPKFSLETEVDLRKPLENLGM 
10 PLHVAQALQKVKIEVNESGWASSSTAVIVSARMAPEEIIMDRPFLFW 

236 

GMTTCCTGCAGCTCAGCAGCCGCCGCCAGAGCAGGACGMCCGCCM 
TCTCCAGCCCTCACCTGCCTAGTCCTGGGCCTGGCC^ 

GCCCACCTGGCCTCAGACTTCGGGGTGAGGGTGTTTCAGCAGGTGGCGCAGGCCTCCMGGA 
15 TATGGGGTGGCCTCGGTGTTGGCCATGCTCCAGCTGA 

MGATTGATGACAAGGGCATGGCCCCCGCCCTCCGGCATCTGT^ 

ACCACAGACGCGATCTTCGTCCAGCGGGATCTGMGCTGGTCCAGGGCTTCATGCCCCACTTCTTCA 

GTCMGCMGTGGACTTTTCAGAGGTGGAGAGAGCCAGATTCATCATC 

AGCMCTTGCTTGGGAMGGAGCCGTGGACCAGCTGACACGGCTGGTGCTG^ 
20 ACTCCCTTCCCCGACTCCAGCACCCACCGCCGCCTCTTCCACAMTCAGACGGCAGCACTGTCTCTGTGCCCA 

ACCMCMGTTCMCTATACTGAGTTCACCACGCCCGATGGCCATTACTACGACATCCTGGAACTGCCCTACCACGGGGACACC 

CTCAGCATGTTCATTGCTGCCCCTTATGAAAMGAGGTGCCTCTCTCTGCCCTCACCAACATTC 

CACTGGAMGGCMCATGACCAGGCTGCCCCGCCTCCTGGTTCTC 

CCCCTAGAGMCCTGGGMTGACCGACATGTTCAGACAGTTTCAGGCTGACTTCACGAGTC 
25 GTCGCGCAGGCGCTGCAGAAAGTGAAGATCGAGGTGMCGAGAGTGGCACGGTGGCCTC 

GCCCGCATGGCCCCCGAGGAGATCATCATGGACAGACCCTTCCTCTTTC 

ATGGGCCMGTGATGGMCCCTGACCCTGGGGAMGACGCCTTCATCTGGGACAAMCTGGAGATGCATC 
TCCGMGAAMGMTTTTAGTGTTMTGAC^^ 

TCTGTCTCCMGACCTTGGCCTCTCCTTGGAGGACCTTTAGGTCAMCTCCCTAGTCTCCACCTGAG 
30 TGAAGCACMCTCCCTTMGGTCTCCAMCCAGACGGTGACGCCTGCGGGACCATCTGGGGCACCTGCTTCCACCCGTCTCTCT 
GCCCACTCGGGTCTGCAGACCTGGTTCCCACTGAGGCCCTTTGCAGGATGGAACTACGGGGCTTACAGG 
GGTAGAMCTATTTCTGTTCCAGTCACATTGCCATCACTCTTGTACTGCCTGCCACCGC 
GCCAGTGGMGAMCACCOTTCATCTCAGAGTCCACTGTGGCACTGGCCACCCCTCCCCAGTACAGGGGTC 
GAGTGMTGTCCCCCATCATGTGGCCCMCTCTCCTGGCCTGGCCATCTCCCTCCCCAGAMCAGTC^ 
35 AGTGTAGGTGACTTCTTTACTCATTGMGCAGATTTCTGCTTCC 
TGCCCAGCTCTTCACCCCCCMTCTCTTGGTGG 

GAGAGAMGAGMCTACTMGGAAMTAATATTATTTAMCTCGCTCCTAGTC 

GMGTCCAGCCACTTGACTGGCACACACCCCTCCGGACATCCAGCG 

CCCTCGCGCCCCCCGCGCCCCCCGCGCCCCTCTT^ 
40 CAAMGGACGGAGTGGGGGGACAGAGACTCAGATGAGGACAGAGTGGTTTCCM 

AAGGGGCTGCATGACCTACCAGGACAGMCTTTCCCCMTTACAGGGTGACTCACAGCCGCATO 

CATTTCCGGCTGCTGTGTGTGAGCAGTGGACACGTGAGGGGGGGGTGGGTGAGAGAGACAGGCAGCTC 

GATAATATTTCTGAAMCCTACCAGCCAGAGGGTAGGGCACAMGATGGATGTMTGCAC 

TGCTTGAGCCCAGGAGTTCMGACCAGCCTGGGCMCATACCMGACCCCCGTCTOT 
45 TTAAATATATATTTCTMTATCTTTAMTATATATATATATTTTAMGA^ 

GAATGTAATCTAATAGAAGC 

237 

MLARALLLCAVLALSHTANPCCSHPCQNRGVCMSVG 
WNVVNNIPFLRNAIMSmTSRSHLIDSPPTYM^^ 
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LRRKFIPDPQGSNMMFAFFAQHFTHQFFKTDHKRGPAFTNGLGH^ 
KDTQAEMIYPPQVPEHLRFAVGQEVFGLVPGLMOTATIWLREHNRVCDVLKQEHPE 
HLSGYHFKLKFDPELIiFNKQFQYQNRIAAEFOTLYHWHPLLPDTFQIHDQKYNYQQFIYNNSILLEHGITQ 
AGGRNVPPAVQKVSQASIDQSRQMKYQSFNEYRKRFMLKPYESFEELTGEKEMSAELEALYGDIDAVELYPALLVEKPRPDAIF 
5 GETMVEVGAPFSLKGLMGNVICSPAYWKPSTFGGEVGFQH 
DDINPTVLLKERSTEL 

238 

CMTTGTCATACGACTTGCAGTGAGCGTCAGGAGCACGTCCAGGAACTCCTC^ 

CCCTCAGACAGCAAAGCCTACCCCCGCGCCGCGCCCTGCCCGCCGCTCGGATGCTCGCCCGCGCCCTGCTGCTGTG^ 
10 TGGCGCTCAGCCATACAGCAMTCCTTGCTGTTCCCACCCATGTCAAMCCGAGGTGTATGTATC 
ATMGTGCGATTGTACCCGGACAGGATTCTATGGAGAAMCTGCTCMCACCGGMTTTTO 
MCCCACTCCAMCACAGTGCACTACATACTTACCCA 

ATGCMTTATGAGTTATGTCTTGACATCCAGATCACATTTGATTGACAGTCCACCAAC™ 
"GCTGGGMGCCTTCTCTMCCTCTCCTATTATACTAGAG 
15 MC^TAAAMGCAGCTTCCTGATTCAMTGAGATTGTGGAAAMTTGCTTCTAAGMG 

CAMCATGATGTTTGCATTCTTTGCCCAGCACTTCACGCATCAGTTTTTCMGACAGATCATM 
ACGGGCTGGGCCATGGGGTGGACTTAMTCATATTTACGGTGAMCTCTGGCTAGACAGCG^ 

GAAAAATGAAATATCAGATAATTGATGGAGAGATGTATCCTCCCACAGTCAAAGATACTCAGGCAGAGATGATCTACCCTCCTC 
MGTCCCTGAGCATCTACGGTTTGCTGTGGGGCAGGAGGTCTTTGG 
20 TGCGGGMCACMCAGAGTATGCGATGTGCTTAAACAGGAGCATCCTGMTGGGGTGATGAGCAGTTGTTCCAGACAAGCAGGC 
TMTACTGATAGGAGAGACTATTMGATTGTGATTGMGATTATGTGCAACACTTGAGTC 
ACCCAGMCTACTTTTCMCAMCAATTCCAGTACCAAMTCGTATTGCTGCTGMTTTMCACC 
TTCTGCCTGACACCTTTO^TTCATGACCAGAAATACMCTATC 

GMTTACCCAGTTTGTTGMTCATTCACCAGGCAMTTGCTGGCAGGGTTGCTGGTGGTAGGM 
25 MGTATCACAGGCTTCCATOGACCAGAGCAGGCAGATGAMTACCAG 

CCTATGMTGATTTGMGMCTTACAGGAGAAAAGGAAATGTCTGCAGAGT^ 
AGCTGTATCCTGCCCTTCTGGTAGAAMGCCTCGGCCAGATGCCATCTTTG^ 
CCTTGAMGGACTTATGGGTMTGTTATATGTTCTC^ 

TCATCAACACTGCCTCAATTCAGTCTCTCATCTGCAATAACGTGAAGGGCTGTCCCTTTACTTCATTCAGTGTTCCAGATCCAG 
30 AGCTCATTAAMCAGTCACCATCMTGCMGTTCTTCCCGCTCCGGACTAGATC^ 

GTTCGACTGMCTGTAGMGTCTMTGATCATATTTATTTATTTATATGMCCATGTC 

tatattamctccttatgttacttmcatc 

GACTTTTATGTCACTACTCTAMGATTTTGCTGTTGCTGTTAAG 

AAATGAGTTTTGACGTCTTTTTAOT 
35 CACAAGATGGCAAAATGCTGAMGTTTTTACACTGTCGATGTTTCCMTG 

TGAMTTTTAMGTACTTTTGGTTAOT^ 

GACATTACCAGTMTTTCATGTCTACTTTTTAAMTCAG 

ACCTGTAAMGCTTGTTTGATTTCTTAMGTTATTAMCTTGTACA 

MTCAGATGAMTTTTACTACMTTGCTTGTTAAMTATTTTAT 
40 GTGTGACTGTTAAMCTTCCTTTTAMTCA 

■ MTATTTTGTTGAGATATTCCAGMTTTGTTTATATGG 

ATMGCMTAACAAAGMGAAMCCAMTTATTGTTCAMTTT^ 

ACTGCAGGCCTGGTACTCAGATTTTGCTATGAGGTTMTGMGTACCMGCTC 

TCTGTTGTACAGTTTMTTTAGCAGTCCATATCACATTGCAAMGTAGCMTGACCTCAT 
45 ATTCATTTCACACATTAATTTTATCTCAGTC 

TTCCTTTTCTTTTCTTCTTTTAGCCATTT^ 

TTTAAGATCAGAGTTCACTTTCTTTGGACTCTGCCTAT^ 

AGGACTGCTATTTAGCTCCTCTTMGMGATTA^ 

AAMGCAGAGMTTTTATTTATAGCTMTTTTAGCTATCTGTMCCMG 
50 GTACGGGGTTTGTGACTGGAAAMGTTACX5TTCCCATTCTMTTAATGCCCTT 

MGTAGTTCTCAGCMTAATAATMTGACGATMTACTTCTTTTCCACATCTCATTGTCACTGAC 
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TACTTMTTTATTGAAGATTATTATTTATCT 
TTTTTTGTTATGTCACAATCAGTATATT^ 

TMGATTTGTGMTAMTTTTTAGAMTCTGATTGGCATATTGAGATATTTMGG 
GTGCTAGCCCACAMGMTATTGTCTCATTAGCCTGMTGTG^ 
5 TGCTTCGTTMTTTGTTCAGCCACMTTTAT 

GCTGATTACAGATAATAGTATTTATATAMTAATO 
AMGATMCTCAGGAGMTCTTCTTTACMTTTTACGOT 
AAACACTGTTCACTGTTTTT^ 
GTGTATGCGMTGTTTCAGTGCCTC^ 
10 TTTGTACTATTTA 

239 

MDPNCSCEAGGSCACAGSCKCKKCKCTSCKKSCCSCCPLGCAKCAQGCICKGASEKCSCCA 

240 * * ' 

CTCCAGTCTCACCTCGGCTTGCMTGGACCCCMCTGC^ 
15 GCAAAMGTGCAMTGCACCTCCTGCMGMGAGCTGCTGCTCCTGTOCCCCCTC^ 
TCTGCAMGGGGCGTCAGAGAAGTGCAGCTGCTGTGCCTGATGTCGGGAC 
AAMTCCGAGGTTTTTTTTTTCTACMCTCCGAC 
CACTTAGACTTGAAAAAAAAAAAAAAAAAAA 

241 

20 MDPNCSCSPVGSCACAGSCKCKECKCTSCKKSCCSCCPVGCAKCAQGCICKGTSDKCSCCA 
242 

CCGCTGCGTGTTTTCCTCTTGATCGGGMCTCCTGCTTCTCCTTGCCTCGAMTGGACCCCMC 

CTCCTGTGCCTGTGCCGGCTCCTGCAMTGCAMGAGTGCAMTGCACCTCCTGCAAGMGAGCTGCTGCTCCTGC^ 

GGGCTGTGCMMGTGTGCCCAGGGCTGCATCTGCAMGGGACGTCAGACAAGTGCAGCTGCTGTG^ 

TCTTTTTTTCTATGAMTATGTGM 

243 

MDPNCSCAAGVSCTCASSCKCKECKCTSCKKSCCSCCPVGCAKCAQGCICKGASEKCSCCA 
244 

30 ACTCCGCCTTCCACGTGCACCCACTGCCTCTTCCCTTCTCGCTTGGGMCTCTAGTCTCGCCTCGGG 
TGCTCCTGTGCCGCTGGTGTCTCCTGCACCTC^ 
TGCTGCTCCTGCTGCCCTGTGGGCTGTGCCMGTGTGC 

GCCTGATGTCGGGACAGCCCTGCTCCCMGTACAMTAGAGTGACCCGTAAMTCTAGGATTTTT^ 
GACCCCTTTGCTACATTCCCTTTTTTCTGTGAAATATGTGMTMTM 

35 246 

tttamgtcaamttgtttotattgtcagtcacatatttagta^ 
aaamtccaggttgtgcaggttgttctatttacatctgggagmg 
atgcccctttgcagacgcagccctgggacacttggcacagccacggggagcaggmcagcagctcttc 
tcx:actctttgcacttgcaggagccggcgcacgtgcaggagcca 

40 247 ■ 

MGTQKVTPALIFAITVATIGSFQFGYNTGVINAPEKm^ 
VmFGRRNSt^IVlfoLAVTGGCFMGLCKVM 

aqifglefilgseelwpllix;ftilpailqsaalpfcpesprfllinrkeeenakqilqrlwgtqdvsqdi^ 
kqvwlelfrvs syrqpi 1 1 s ivlql sqqlsg inavfyystgifkdagvqep i yatigagwnti ftwslflveragrrtlhm 
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IGLQGMAFCSTLfflVSLLLKDNYNGMSFVCIGAILVFVAF 
PSAAHYLGAYVFIIFTGFLITFLAFT^^ 

248 

gtggggtggggtggggctgggggcitgtcgccctttcaggctccacccttt^ 
5 acccagagatgcctgtmtggtgagactttggatccttcctga 

gttttgttggctgaaamgctgtttc . 
agaaggtcaccccagctctgatatt^ 

atgctcctgagmgatcatamggmtttatcmtaamctttgacggacmggg 

cgtctctctggtccttgtctgtggccatam^ 
10 ttggcaggcgcaattcmtgctgattgtc 

cggttgaaatgctgatcctgggtcgcttggttattggcctcttc 

agatctcgcctactgccctgcggggtgcctttggcactctcaaccagc 

ttggtctggmttcatccttgggtctgmgagctatggccgctgct^ 

cagcccttccattttgccctgamgtcccagaot 
' 1 5. ggttgtggggcacccaggatgtatcccmgacatccaggagatgamgatgagagtgcmggatc * 

ccgtgctagagctctttagagtgtccagctaccgacagcccatcatcatttccattgtgctc 

ggatcaatgctgtgttctattactcmcaggmtct^ 

gtgtggttmtactatcttcactgtagtttcot 

gagggatggctttttgttccacgctcatgactgtttctttgttattam 
•20 gggctatcttk^tctttgtagccttctttgamt^ 
. gcccccgcccagctgcgatggcagtggccggctgctccmctggacctccmcttcctagtcgga™ 

ctcactatttaggagcctacgtttttattatcttcaccggcttcctca™ 

agacccgtggcaggacttttgaggatatcacacg^ 

tcatggagatgaacagcatcgagcctgctmggagaccaccaccmtgtctmgtcgtgcctccttccacctccctc 
25 gggaaagccacctctccctcmcaagggagagacctcatcaggatgmcccaggacgcto 

tctcatcccacgcactccatgagcaccccmggctgcggtttgttggatcttcmtggcttto 

cctcttctgcttaggagagaccgagtgmcctac 

tttcctcccttgggttctgatattgccgcactaggggatataggagaggaamgtm 

ccaggmgcagatacatatgagtgtggmgccggagggtc^ 
30 camttmcttgagttttatttattttatcctctggttt 

tgccgmtgtgcaggtttgttacataggtatatatatc 

taaamcagmggaaattgagattagagggaggtgtttamgagaggttatagagtagmgatttc 

tgcmtmgmtttagggagaaatgttgttcattattggagggtamtgatgtggtc 

mtttctgtccttcagatggamctctttmcttctcgtaamgtcat^ 
35 .ttttttctttaagataatagtttacatgtact 

gttgatgggttctaattttggatggagtccagggmgag 

tccttcttgtagtagtctcattacttt^ 

mttggttcttggtagagattcatotttttccactttgttcttt 

catacctttamggaattccccamgmtgtttatagcamcttg^ 
40 agcgattattatctamgtgtgttgttgctttaggctcacggcacgcttgcgtatgtc 

ccgmtgccctcaggggacttgmtctttccmtamccaggtttagacagtatgagtc 

gaggatgaatgtatgtgcactgtcactttgctctgggtcg 

gcccctttttcatatgttgctcagtctcccm 

gacagtattttggttcttagttctcactgttccctctgctcctggagcctttgmtaa 
45 ggtggctcacgcctgtmtcccagc^emgggaggcctaggcgggcggtcaggggttcgagacca 

MCCCTGTCTCTACTAAAMTGOVAAMTTAGCCGGGCGTGGTGGCGGGCGCCTGTM 
GGAGMTCATGTGAACCCGGGACGCAGGGGTTGCAGTGAGCGGAGATCGCATCATTGCACTCTA 
CTCCGTCTCAAAAAAAAAAAMTGCACATAGCTATCGAGTGTGC 
TCTGGCTCCTCAMCAGTAGGTTGGCAGTAAGGCAGGGTCCCATTO 
50 CTATTGTTACTCTGGTTCTTTGTTTTAAMTAAAAATTC 
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249 

MIASHLLAYFFTELNHDQVQK^ 
LGGTNFRVLVTOVTDNGLQKVEMENQIYAIPEDI^^ 
VSWTKGFKSSGVEGRDWALIRKAIQRRGDFDIDIVAWNDTTC^ 
5 GRMCIMEWGAFGDDGSLNDIRTEFDQEIDMGSLNK3KQL^ 
KDISDIEGEKDGIRKAREVLMRLGLDPTQEDCVA^ 

•PHFAKRLHKTVRRLVPGCDVRFLRSEDGSGKGAAMVTAVAYRLADQHRARQKTLEH^ 
ASAPVKMLPTWCATPKTEKGDFLALDLGGTNFRVLLVRVI^GKWGGVEMHMIYA 
MGMKGVSLPLGFTFSFPCQQNSLDESILLKWTKGFKASGC^ 
10 EVGLIVGTGSNACYMEEMRNVELVEGEEGRMCVNM^ 

ILiDmRGLLFRGRISERLKTRGIFETKFLSQIESITLALLQVRAILQHLGLESTCDDSIIVKEVCTWARRM 
VVDRIRENRGLDALKVWGVDGTLYKLHPHFAKVMHETVKDLAPKCDVSFLQSEIX5SGKGMLITAV 

250 

. • GTTGCATGAMCTCCGGCGCAGGAGTCCCGC&CTC 

•1 5 CACGCGCCTGTGMTCGGAGAGGTCCCACTGCCCGAGTGGAGCCGGGCTGAGATTCra 

GGCCGMGTTAGCGCCTTGACGTGGGACMCCGGACACGTCGCCAGGAGAGMCTGAGGCGCCTTCTAGC 
MTCACGTCTCCGGAGACCCGCGCCCTCCGCCAGCCGGGCGCACCCTCGCCGGTAGCCTTCTTOTGCGCCGTCCGGACTCCCA 
GCTCCCGGCCCGGCAGCCGAGCCCCAGCACAMGCAGTCGGACCGCGCCGCCCGCCTCCCCTCTCGCGTCTCCGCCTCGGTTTC 
CCMCTCTGCGCCGTCGGGCCGCGGCAGGATGATTGCCTCGCATCTGCTT^ 

20 TGCAGMGGTTGACCAGTATCTCTACCACATGCGCCTCTCTGATGAGACCCTCTTGGAGATCTCTAAGCGGTTCCGCAAGGAGA 
TGGAGAMGGGCTTGGAGCCACCACTCACCCTACTGCAGCAGTGMGATGCTGCCCACCTTTGTGAGGTCCACTC 
CAGMCACGGAGAGTTCCTGGCTCTGGATCTTGGAGGGACCMCTTCCGTGTGCTTTGGGTGAAAGTMCGGACAATGGGCTCC 
AGAAGGTGGAGATGGAGMTCAGATCTATGCCATCCCTGAGGACATCATGCGAGGCAGTGGCAC 

CCGMTGCCTGGCTAACTTCATGGATMGCTACAMTCAMGACMGMGCTCCCACTGGGTTTTACCTTCTCGTTCCCCTGCC 

25 accagactamctagacgagagttocctggtctcatggaccmgggaot^ ■ 
ctctgatccggaaggccatccagaggagaggggactttgatatcgacattgtggctgtggtc 
tgacctgtggttatgatgaccacmctgtgagattggot 
gccacatcgacatggtggmggcgatgaggggcggatgtgtatcm^ 
acgacattcgcactgagtttgaccaggagattgacatgggctcactgmcccgggamgcm 

30 ggatgtacatgggggagctggtgaggcttatcctggtgmgatggccmggaggagctgctcttt^^ 
agcttctcmcaccggtcgctttgagaccaaagacatctcagacattgmggggagmggatc 
tcctgatgcggttgggcctggacccgactcaggaggactgcgtggccactcaccggatct 
ccagcctctgcgcagccaccctggccgccgtgctgcagcgcatcmggagmcamggcgaggagcg 
gggtcgacggttccgtctacmgamcacccccattttgccaagcgtctacatmgaccgtgcggcggctggtc 

35 atgtccgcttcctccgctccgaggatggcagtggcamggtgcagccatggtgacagcagtggcttaccggctggccgatcaac 
accgtgcccgccagmgacattagagcatctgcagctgagccatgaccagctgctggaggtcaagaggaggatgaaggtagaaa 
tggagcgaggtctgagcmggagactcatgccagtgcccccgtcmgatgc 
cagagamggggacttcttggccttggaccttggaggmcamtttccgggtcctgctggtccgtgttcgg 
gtggagtggagatgcacmcmgat(^acgccatcccgcaggaggtcatgcacggcaccggggacgagctctttgaccacattc 

40 TCCAGTGCATCGCGGACTTCCTCGAGTACATGGGCATGMGGGCGTGTCCCTGCC 

AGCAGMCAGCCTGGACGAGAGCATCCTCCTCMGTGGACAAMGGCTTCMGGCATCTGGCTGCGAGGGCGAGGAC 

CCCTGCTGAAGGMGCGATCCACCGGCGAGAGGAGTTTGACCTGGATGTGGCTGCTGTGGTGM 
■ TGACCTGTGGCTTTGMGACCCTGACTGTGAAGTTGGCCTCATTGTTGGCACGGGCAG 

GCMCGTGGMCTGGTGGAAGGAGMGAGGGGCGGATGTGTGTGMCATGGMTGGGGGGCATTC 
45 ATGACTTCCGCACAGMTTTGATGTGGCTGTGGATGAGCTTO 

GMTGTACCTGGGTGAGATTGTCCGTMCATTCTCATCGATTTCACCMGCGTGGACTGCTCTTCCG 

GGCTCMGACMGGGGCATCTTTGAMCCMGTTCTTGTCTCAGATTGAGAGTGACTG 

TCCTGCMCACOTAGGGCTTGAGAGCACCTGTGACGACAC£ATCATTC 

CCCAGCTCTGTGGCGCAGGCATGGCCGCTGTGGTGGACAGGATACGAG 
50 GTGTGGATGGGACCCTCTACMGCTACATCCTCACTTTGCC 

ATGTGTCTTTCCTGCAGTCAGAGGATGGCAGCGGGMGGGGGCGGCGCTCATCACTC 
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GACAGCGATAGMCCCCTGAMTCGGMGGGACTTCCTCTTTCTCTCCTTC 
TQTGTCAGAGACAGACCCCTTGGCTTTTGCTTGGCAGAGAGGACCCCACTG 
CTTGGTGGCTGAGCTTGGCCCTATTMGATAMTAGAGTTCGAMTMGGATTO 
TTCTCCTAAMCATGAAAATTATCTCCCmGTMTCCCCCTTGCCAMTTCCATC 
5 ACTAATGMGATACGGTTGCTTCACCTTGGAGCCTGMCATGACATTTC 

CTGATTCTCCTGTCAGAGATCTGGGAGGTCTCCACTGAGGATGTGAGCCTGAmTC 

GGGTGACAGTGKSAGGAAMTCCATGGATATCCACGC^^ 

AGCATTTGCCATTCCTGGAATCAAGGMTC^ 

CAGCTGTCGCATCTCAGCGGGGCGCACATGTTATCCACMGCMTGGACCTTTGGGG 
1 0 MTTTTTCCTGCAAAAGTGGMTCACTGTATT 

CTTCATCTTGACATGTAATGMTGGTCAGTTGTACGTAATGTAm 

GTCAGMTTGGCCTCAGTGTAGTTAMGGGCAGMGGGGMGATACTGACTAGTCATAGAAATACCTCATTCG 
AGAAGGGAAGCCTCTTCAGGGTGAGTGAATGGCAAAGCGGTTGCTTCTCCG 

251 

15 MTSKLAVALLMFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPK^ 
VQRWEKFLKRAENS • 

252' 

AGCAGAGCACACMGCTTCTAGGACAAGAGCCAGGAAGAMCCACCGGMGGMCCATCTCACTGTGTGTAM 

AGCTGGCCGTGGCTCTCTTGGCAGCCTTCCTGATTTCTGCAGCTCTGTGTC 
20 TTAGATGTCAGTGCATAAAGACATACTCGAMCCTTTCCACCCCAMTTTATCAMGMCTGAGAGTC 

ACTGCGCCMCACAGAMTTATTGTAMGCTTTCTGATGGMGAGAGC 

TTGTGGAGMGTTTTTGMGAGGGCTGAGMTTCATAAAAAMTTCATTCTCTGTGG 

GAAACTTCMGCAAATCTACTTCMCACTTCATGTATTGTGTGGGT^ 

TTAMTmMTTTCAGTAMCMTGMTAGTTTTO 
25 TATTTGMTCTACAAAAMCMCAMTMTTTTTAMTATMGGATT^ 

MTTGAGCCMGGGCCAAGAGMTATCCGMCTTTMTTTCAGGMTTGMTGGGTTTGCTAGM 

CATAAAMTGATGGGACAATAAATTTTGCCATAMGTC 

CCTAGTCTGCTAGCCAGGATCCACMGTCCTTGTTCCACTGTGCC^ 

CACCATCTTACCTCACAGTGATGTTGTGAGGACATGTGGMGCACTTTMGTTTTTT^ 
30 TMCTTATTMCCTATTTATTATTTATGTAT 

ATTGATTGMTAGTTATAAAGATGTTATAGTAMTTTATTTTATTTTAGATATTAMTGATG ■ 

AGGGTTTTTAGATTAMCAMGAMCMTTGGGTAeCCAGTTAMTTTTCATTTCAGATAM 

AGTACATTATTGTTTATCTGAMGTTTTMTTGAACT 

GGTAGTGCTGTGTTGMTTACGGMTMTGAGTTAGAACTAT^^ 
3 5 CTGGTTGAAACTTGTTTATTATGTACAMTAGATTC 

TTTTMCTTTMGATGTTTTTATGTG 

CATTTAAAATATAATTTGTTGTCAAAGTAAAAAAAAAAAAAAA 

253 

MGKVKVGWGFGRIGRLVTRMFNSGKVDIVAINDPFI^^ 
40 IKWGDAGAEYWESTGVFTTMEKAGAHLQGGAKRVI I SAPSADAPMFVMGVNHEKYDNSLKIISNASCTTNCLAPLAKVIHDNF 
GIVEGLMTTVHAITATQKTVDGPSGKLWRDGRGALQNI I PASTGAAKAVGKVI PELNGKLTGMAFRVPTANVSWDLTCRLEKP 
AKYDDIKKVVKQASEGPLKGI LGYTEHQWSSDFNSDTHSSTFDAGAGI ALNDHFVKLI SWYDNEFGYSNRWDLMAHMASKE 

254 

CTCTCTGCTCCTCCTGTTCGACAGTCAGCCGCATCTTCTTTTGCGTCGCCAGCCGAGCCACATC 
45 GTGAAGGTCGGAGTCMCGGATTTGGTCGTATTGGGC^^ 

GCCATCMTGACCCCTTCATTGACCTCMCTACATGGTTTACATC 

GTCMGGCTGAGMCGGGAAGCTTGTCATCMTGGAMTCCCATCACCATCOT 

GGCGATGCTGGCGCTGAGTACGTCGTGGAGTCCACTGGCGTCTTCACCACCATGGAGM 
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GCCAAMGGGTCATCATCTCTGCCCCCTCTGCTGATGCCCCCATGTTCGTCATGGGTG 
CTCMGATCATCAGCMTGCCTCCTGCACCACCMCTGCTTAGCACCCCTGGCCMGGTCATCCAT^ 
GMGGACTCATGACCACAGTCCATGCCATCACTGCCACCCAGMGACTGTGGATGGCCCCTC 
CGCGGGGCTCTCCAGMCATCATCCCTGCCTCTACTGGCGCTGCCMGGCTGTGGGCMGGTC 
5 CTCACTGGCATGGCCTTCCGTGTCCCCACTGCCMCGTGTCAGTGGTGGACCTGACCTGCCGTCTA 

GATGACATCMGMGGTGGTGMGCAGGCGTCGGAGGGCCCCCTCMGGGCATCCTGGGCTACACTGAGC 

TCTGACTTCMCAGCGACACCCACTCCTCCACCTTTGACGCTGGGGCTGGCATTGCCCTCMCGACC 

TCCTGGTATGACMCGMTTTGGCTACAGCMCAGGGTGGTGGACCTCATGGCCCACATG^ 

ACCACCAGCCCCAGCAAGAGCACAAGAGGAAGAGAGAGACCCTCACTGCTGGGGAGTCCCTGCCACACTCAGTCCCCCACCACA 
10 CTGMTCTCCCCTCCTCACAGTTGCCATGTAGACCCCTTGMGAGGGGAGGGGCCM 
AATAAAGTACCCTGTGCTCAACC 

255 

MSLSNKLTLDKLDVKGKRVVMRVDFWPM™^ 
• LKSLLGKDVLFLKDCVGPEVEKACAN^ 
15 AHRAHSSMVGVNLPQKAGGFLMKKELNYFAKALESPERPFLAILGGAK^^ 
IGTSLFDEEGAKIVKDLMSKAEKNGVKITLPVD^^ 
NGPVGWEWEAFARGTKALMDEWKATSRGCITO 

256 

MGCCTCCGGAGCGCACGTCGGCAGTCGGCTCCCTCGTTGACCGAATCACCGACCTCOT 
20 GCTTTCTMCMGCTGACGCTGGACMGCTGGACGTTAMGGGM 

GMCAACCAGATMCAMCMCCAGAGGATTMGGCTGCTGTCCCMGCATC^ 

AGTCCTTATGAGCCACCTAGGCCGGCCTGATGGTGTGCCCATGCCTGACMGTACTCCTTAGAGCCAGTTGCTGTAGAACTCAA 
ATCTCTGCTGGGCMGGATGTTCTGTTCTTGMGGACTGTGTAGGCCCAGMGTC 
GTCTGTCATCCTGCTGGAGMCCTCCGCTTTCATGTGGAGGMGAAGGGAAGGGAAMGATGCTTC 
25 CGAGCCAGCCAAMTAGMGCTTTCCGAGCTTCACTTTCCMGCTAG^ 

CAGAGCCCACAGCTCCATGGTAGGAGTCMTCTGCCACAGMGGCTGGTGGGTTTTTGATGMGMGG 

AMGGCCTTGGAGAGCCCAGAGCGACCCTTCCTGGCCATCCTGG&^ 

TATGCTGGACAAAGTCMTGAGATGATTATTGGTGGTGGAATC^ 

CACTTCTCTGTTTGATGMGAGGGAGCCMGATTGTCAAAGACCTMTGTCCAMGCTGAGMGAATGG 
30 GCCTGTTGACTTTGTCACTGCTGACMGTTTGATGAGMT^ 

CTGGATGGGCTTGGACTGTGGTCCTGAMGCAGCMGMGTATGCTGAGGCTGTCACTCGGGCT 

TCCTGTGGGGGTATTTGMTGGGAAGCTTTTGCCCGGGGMCCAM 

CTGCATCACCATCATAGGTGGTGGAGACACTGCCACTTGCTGTGCCAMTGGMCACGGA 

TGGGGGTGGTGCCAGTTTGGAGCTCCTGGMGGTAMGTCCTTCC 
35 CTTTTAGTTCCTGTGCACAGCCCCTMGTCAACTOGCATTTTCTGCATCTCCAC™ 

GATTCAGCTAGTGGCCMGAGATGCAGTGCCAGGMCCCTTAMCAGTTGCACAGCATCTCAGCTCATC 

ATTTGCATACATTCTTCMGATCCCATTTGAATTT^ 

GCCTGTTAAAMGAMGTGAGCAGTGTTAGCTTAGTTCTCTTTTG 

CTCAGGATGGAMCAAGATGAMTTCCATTTCTAGGTAGTC 
40 TTG 

257 

MSILKIHAREIFDSRGNPTVEVDLFTSKGLFRMVPSGASTGIYEAL^ 
TEQEKIDKLMIEMDGTENKSKFGANAILGVSLAVCKAGAVEKGVPLYRHIADIAGNSm 

MILPVGAANFREAMRIGAEVYHNLKNVIKEKYGKDATNVGDEGGFAPN I LENKEGLELLKTAIGKAGYTDKWIGMDVAASEFF 
45 RSGKYDLDFKSPDDPSRYISPDQLADLYKSFIKDYPWSIEDPFDQDDWGAWQKFTASAGIQWGDDLTVTNPKR 
CNCLLLKVNQIGSWESLQACKLAQANGWGVMVSHRSGETEDTFIADLWGLCTGQI 
AKFAGRNFRNPLAK 



258 
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TAGCTAGGCAGGMGTCGGCGCGGGCGGCGCGGACAGTATCTGTCX3GTACCCGGAGCACGGAGATCTC(^ 
CCTCGGTGTCTGCAGCACCCTCCGCTTCCTCTCCTAGGCGACGAGACCCAGTGGCTAGMGTTC 
CCATGCCAGGGAGATCTTTGACTCTCGCGGGMTCCCACTGTTGAGGTTGATC 
TGTGCCCAGTGGTGCTTCMCTGGTATCTATGAGGCCCTAGA^^ 
5 CTCAMGGCTGTTGAGCACATCMTAAMCTATTGCGCCTGCCCTO 
TGACAMCTGATGATCGAGATGGATGGMCAG 
• * CT(X:AMGCTGGTGCCGTTGAGMGGGGGTCCCCCTGTACCGCCACATO 

AGTCCCGGCGTTCMTGTCATCMTGGCGGTTCTCATGCTGGCAACAAGCTGGCCATGCAGGAGTTCATGATCCTCCCAGTCGG 
TGCAGCAMCTTCAGGGMGCCATGCGCATTGGAGCAGAGGTTTAC 
10 AGATGCCACCAATGTGGC^GATGAAGGCGGGTTO 

TATTGGGAMGCTGGCTACACTGATAAGGTGGTCATCGGCATGGACGTAGCGGCCTCCGAGTO 

CCTGGACTTCAAGTCTCCCGATGACCCCAGCAGGTACATCTO; 

CTACCCAGTGGTGTCTAl'CGMGATCCCTTTGACCAGGATGACTGG 

GGTAGTGGGGGATGATCTCACAGTGACCMCCCAMGAGGATCGCCMGGCCGTGMCGAGMGTCCTGCMCTGCCTC 

15 CAAAGTCMCMGATTGGCTCCGT^^ 

TCATCGTTCGGGGGAGACTGMGATACCTTCATCGCTGACCTGGTTGTGGGGCTGTGCA 
TTGCCGATCTGAGCGCTTGGCCMGTACMCCAGCTCCTCAGMTTGMGAGGA 
GMCTTCAGAMCCCCTTGGCCMGTMGCTGTGGGCAGGCMGCCCTTCGGTCACCTGTTGGCTACAC 
TGTCAGCTCAGGCAGCTCGAGGCCCCCGACCMCACTTGCAGGGGTCCCTGCTAGTTAGCGCCCCACCGCCGTGGAGTTCGTAC 

20 CGCTTCCTTAGAACTTCTACAGMGCCMGCTCCCTGGAGCCCTGTTGGCAGCTOT 
TCATTGTTTTTCTCGCC^ 

TCCCCGGTGGTTTTGTGCTCAAAATAAAAAGCCTCAGTGACCCATGAG 
259 . 

MPHSYPALSAEQKKELSDIALRIVAPGKGILMDESVGSMKRLSQIGVENTEENRRLYRQVLFSADDRVKKCIGGVIFFHETL 
25 YQKDDNGVPFVRTIQDKGI WGI KVDKGWPLAGTM SERTPSALAI LENA 

NVLARYASICCX2NGIVPIVEPEILPDGDHDLKRCQYVTE 
WTALRRTVPPAVPGVTFLSGGQSEEEASFNLNAI^ 
LAAQGKYEGSGEDGGAAAQSLYIANHAY 

260 

30 CCGAGCTGTGCTTGTGGCTGCGGCTGCTMCTGGCTGCGCACAGGGAGCTG 

CTGAGCAGMGAAGGAGTTGTCTGACATTGCCCTGCGGATTGTAGCCCCGGGCAMGGCATTCTGGCTGCGG 

GCAGCATGGCCMGCGGCTGAGCCAMTTGGGGTGGAAMCACAGAGGAG 

CTGATGACCGTGTGAAAMGTGCATTGGAGGCGTCATTTT^ 

TCGTCCGMCCATCCAGGATMGGGCATCGTCGTGGGCATCMGGTTGACMGGGTGTGGTGCCTC 
35 AMCCACCACTCMGGGCTGGATGGGCTCTCAGAACGCTGTGCCCMTACAAGMGGATGGTGCTGACT^ 

GTGTGCTGAAMTCAGTGAGCGTACACCCTCTGCACTTGCCATOCT 

GCCAGCAGMTGGCATTGTGCCTATTGTGGMCCTGAMTATTGCCTGATGGAGACCACG 

CAGAGAAGGTCTTGGCTGCTGTGTACMGGCCCTGAGTGACCATCATGTATACCTGGAGGGGAC 

TGACCCCGGGCCATGCCTGTCCCATCMGTATACCCCAGAGGAGATTGCCATGGCMCTC^ 
40 CCCCAGCTGTCCCAGGAGTGACCTTCCTGTCTGGGGGTCAGAGCGAAGMGAGGCATCATTCMCCTCM 

GCCCCCTTCCCCGACCCTGGGCGCTTACCTTCTCCTATGGGCGTGCCCTGCMGCCTCTGCACTCMTGCCTC 

GGGACMTGCTGGGGCTGCCACTGAGGAGTTCATCMGCGGGCTGAGGTGMTGGGCTO 

GTGGAGMGATGGTGGAGCAGCAGCACAGTCACTCTACATTGCCAACCATGCCTACTG 

GGCCCAGCCATCTGCACCCACTTTTGCTTGTAGTCATGGCCAGGGCCAMTAGCTATX^ 
45 CCAACTTGTCTTCClTrcTCTCTTCCCTTCCCCTCTCTC 

ATGACTGAGGGCAGMGAAATTGCTAGMGTCAGMCAGGATGGCTGG^ 

CATGATGAGGTAGCTTCTCCCTGGGCTCTCCTTCTTGCCTGCCCnt5TCTC 

ATGCCTTGAGTACATACCATACAGCAAATAAATGGTAGCAAAACAAAAAAAAAAAAAAAAAAAAAAAAAA 



261 



WO 02/46465 



PCT/GB01/05458 



346 



MAPSRKFFVGGNWKMNGRKQSLGELIGTLNAAKVPAOT 
KDCGATW\AfijGHSERRHWGESDELIGQKVAHALAEGIjGVI^ 

WAIGTGKTATPQQAQEVHEKLRGWLKSNVSDAVAQSTRI I YGGSVTGATCKELASQPDVDGFLVGGASLKPEFVDI INAKQ 
262 

5 GCGCGACACTCACCTTCAGCGCCTCGGCTCGG 

GCGGMGCAGAGTCTGGGGGAGCTCATCGGCACTCTGMCGCGGCCMGGTGCCGGCCGACACCGAGGTGGTTTGTGCTCCC 
TACTGCCTATATCGACTTCGCCCGGCAGMGCTAGATCCCMGATTGCTGTGGCTGCGCAGAACTC 
GGCTTTTACTGGGGAGATCAGCCCTGGCATGATCAAAGACTGCGGAGCCACGTGGGTGGTC 
TGTCTTTGGGGAGTCAGATGAGCTGATTGGGCAGAM^ 

10 GGAGMGCTAGATGAAAGGGMGCTGGCATCACTGAGMGGTTGTTTTCGAGCAG 

CTGGAGCMGGTCGTCCTCGCCTATGAGCCTGTGTGGGCCATTGGTACTGGCMGACTGCMCACCCCMCAGGCCCAGGMGT 
ACACGAGMGCTCCGAGGATGGCTGAAGTCCAACGTCTCTGATGCGGTGGCTCAGAGCACCCGTATCA 
GACTGGGGCAACCTGCAAGGAGCTGGCCAGCCAGCCTGATGTGGATGGCT^ 
CGTGGACATCATCAATGCCAAACMTGAGCCCCATCCATCTTCCCTACCCTTCCTGCCMGCCA 

15 CCCAGTMCTGCCCTTTCCCTGCATATGCTTCTGATGGTGTCATCTGCTCCTTCCTGTG^ 
TACTGTTTATATCTTCACCCTGTMTGGTTGGGACCAGGCCAATCCCTTC 
CAAGGTGGCTTCTCCTTGGCTGAGAGATGGMGGCGTGGTGGGATTTGCTCCTGGGTTCCCT 
GAMCCATCCTCTCCCTTCTTACACCGTGAGGCCMGATCCCCTCAGMGGCAGGAGTGCTGCCCTCTCCCATG^ 
CTCTGTGCTGTGTATGTGAACCACCCATGTGAGGGAATAAACCTAGCACTAGG 

20 263 

MASKLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMMQYIEKSLLVPDHVITRLMMSELENRRGQH 

WLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRW 

ARLRQYKDVAKPVIELYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY 

264 

25 CGGCGCTGGGCTGAGGGGAGGGGTTGTCTTAAMGTCTCTCCTTCCCCCTGTAGGGGCGG 

' GGGTGCCAGAGGTAGGGGGCCGAGAMCAMGTTCCCGGGGCTTCCTCCGGGGCCGCGGTCGGGGCTGCGCGTTTGACCGCCCC 

CCTCCTCGCGMGCMTGGCTTCCAMCTCCTGCGCGCGGTCATCCTCGGGCCGCCCGGCTCG^ 

AGGATCGCCCAGMCTTTGGTCTCCAGCATCTCTCGAGCGGCCACTTCTTGCGGGAGM 

GAGATGGCAMGCAGTATATAGAGAAAAGTCTTTTGGTTCCAGACCATC 
30 AGGCGTGGACAGCACTGGCTCCTTGATGGTTTTCCT 

CTAGTGATCAGTTTGAATATTCeATTTGAMCACTTAMGATCGTCTCA 

TATAACCTGGACTTCAATCCACCTCATGTACATGGTATTGATGACGTCACTGGTGMCCGTTAGTCCAGCAGGAGGATGATAAA 

CCCGMGCAGTTGCTGCCAGGCTMGACAGTACAMGACGTGGCAMGCCAGTC 

CACCMTTTTCCGGMCGGAGACGMCAAMTCTGGCCCTACGTTTACACACTTTTCTCAM 
35 AMGMGCATATTGACCCTGCCCAATGGMGMCCAGGAAGATGTGGTCATTCATTC 

TCCAMTTAGMGCTAGCTGAGGTAGCTTGCAGCATCTTTTCTAGTTGAMTC 

GAGTTCATGMGAGGCCCTCCTCTGCCTTTCAAMGGCTGGTCACCTACACATGTTTMGGTGTCTC 

CATCACMGAMGCMGTACAGTGTGGATTTCAMTGGTGTGTM 

MTATTTTTGACATGTGATGGTGATAGTCTCTGGTTCTCCCCATCCCCACAM 
40 GMTGMTCCTGAGGGCTCTAGCCCAGGCTTTGTCCCAGGCTTTC 

TTACTCATAGTGGTTGTTTCTCTGGTCTTGAGTGACTGTCT 

TCCACGCTCCATAGAGTCTCTCCTTTTCAGACATCCTGGGATGAMGM 

GTTTTCACTCTTAGGCTTTTAMCMTAGTTATTGCTTTTATC 
, AATCTCTGTATGCACTGAGAACTGAGCTATGMGAGMTCTTATTAAACTGCTGGTC 
45 TCTTTTATTGTGAAAAAAAAAAAAAAA 

265 

MDPNCSCAAGDSCTCAGSCKCKECKCTSCKKSCCSCCPVGCAKCAQGCICKGASDKCSCCA 
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266 

ATGGATCCCMCTGCTCCTGCGCCGCCGGTGACTCCTGCACCTGCGCCGGCTC^ 
TGCMGAAMGCTGCTGCTCCTGCTGCCCTGTGGGCTGTG^ 
TGCAGCTGCTGCGCCTGA • 

5 267 

MRIAVICFCLLGITCAIPVKQADSGSSEEKQLYMYPDAVATWL^ 

SQDSIDSNDSDDVDDTDDSHQSDESHHSDESDELVTDFPTDLPATEVFTPWPTVDTYDGRGDSVVYGLRSKSKKFRRPDIQYP 

DATDEDITSHMESEELNGAYKAIPVAQDLNAPSDWDSRGKDSYETSQLDDQSAETHSHKQSRLYKRKANDESNEHSDV 

SKVSREFHSHEFHSHEDMLWDPKSKEEDKHLKFRISHELDSASSEVN 

10 268 

GCAGAGCACAGCATCGTCGGGACCAGACTCGTCTCAGGCCAGTTGCAGCCTTCTCAGCCAMCGCCGACCMGGAA 
ACCATGAGMTTGCAGTGATTTGCTTTTGCCTCCTAGGCAT^ 

GAGGAAMGCAGCTTTACMCAMTACCCAGATGCTGTGGCCACATGGCTAMCCCTGACCCATCTCA 

GCCCCACAGACCCTTCCMGTMGTCC&ACGA 
15 GACAGCCAGGACTCCATTGACTCGMCGACTCTGATGATGTAGATGACACTGATGATTCTC 

TCTGATGMTCTGATGMCTGGTCACTGATTTTCCCACGGACCTGCCAGCAAC^ 

GACACATATGATGGCCGAGGTGATAGTGTGGTT^^ 

CCTGATGCTACAGACGAGGACATCACCTCACACATGGA^ 
• GACCTGMCGCGCCTTCTGATTGGGACAGCCGTGGGAAGGACAGTTATC 
20 CACAGCCACAAGCAGTCCAGATTATATMGCGGAAAGCCAATGATGAGAGCAATGAGCATTCCGATGTGATTC 

ctttccamgtcagccgtgaattccacagccatgmtttcac 
gmgataaacacctgamtttcgtatttctcatgmttagatagtgcatc 
ttctcactttgcatttagtcaamgaaaamtgctttatag 
gttgmtgtgtatctatttgagtctggamtmctmtgtgtttgatmttagt™ 
25 amcta^gcttcagggttatgtctatgttc^^ 

tcatgmtagamtttatgtagmgcamcaamtact^ 
ttttgttttttaagttagtgtatattttc 
tttggtggtgtcmttgcttatttgttttcccacggttgtccagcaact 
aaaaaaaaaaaa 

30 269 

MmVSWVALTAGLVAGTO^ 

PFPEAVACGDGHHCCPRGFHCSADGRSCFQRSGMSVGAIQCPDSQFECPDFSTCCVMVDGSWGCCPMPQASCCEDRVHCCPHG 
AFCDLVHTRC I TPTGTHPLAKKLPAQRTNRAVALS S SVMC PDARSRC PDGSTCCELPSGKYGCC PMPMATCC SDHLHCCPQDTV 
CDLI QSKC LSKENATTDLLTKLPAH WGDVKCDMEVSC^ 
35 QGPHQVPWMEKAPAHLSLPDPQALKRDVPCDNVSSCPSSDTC 

EIVAGLEKMPARRASLSHPRDIGCDQHTSCPVGQTCCPSIX^SWACCQLPHAVCCEDRQHCCPAGYTCNV^ 

ATFLARSPHVGVKDVECGEGHFCHDNQTCCRDNRQGWACCPYRQG^CCADRRHCCPAGFRCMRGTKCLRREAPRWDAPLRDPA 

LRQLL 

270 

40 • GTAGTCTGAGCGCTACCCGGTTGCTGCTGCCCMGGACCGCGGAGTCGGACGCAGGCAGAGCATC 
TGGCCTTMCAGCAGGGCTGGTGGCTGGAACGCGGTGCCCAGATGGTCAGTTCTGCCCTGTGGC 
GAGCCAGCTACAGCTGCTGCCGTCCCCTTCTGGACAMTGGCCCACMCACTGAGCAGGCATCl'GGGTGGC 
ATGCCCACTGCTCTGCCGGCCACTCCTGCATCTTTACCGT^ 

GCGGGGATGGCCATCACTGCTGCXCACGGGGCTTCCACTGCAGTGCAGACGGGCGATCCTGCTTC 
45 CCGTGGGTGCCATCCAGTGCCCTGATAGTCAGTTCGMTGCCCG 

GGTGCTGCCCCATGCCCCAGGCTTCCTGCTGTGMGACAGGGTGCACTGCTGTCCGCACGGTG 
CCCGCTGCATCACACCCACGGGCACCCACCCCCTGGCAMGMGCTCCCTGCCCAGAGGACTMCAGG 

gctcggtcatgtgtccggacgcacggtcccggtgccctgatggttctacctcx:tc 
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GCCCAATGCCCMCGCCACCTGCTGCTCCGATCACC 

GCCTCTCCMGGAGMCGCTACCACGGACCTCCTCACTMGCTC 

TGAGCTGCCCAGATGGCTATACCTGCTGCCGTCTA 

AGGACCACATACACTGCTGTCCCGCGGGGTTTACGTGTGACACGCAGMGGGTACCTGTGMCAGGGGCC 
5 GGATGGAGMGGCCCCAGCTCACCTCAGCCTGCCAGACCCACAAGCCTTGMGAGAGATGTCCCCTGTGATAATGTCAGCAGCT 
GTCCCTCCTCCGATACCTGCTGCCMCTCA^ 
ACCAGCACTGCTGCCCCCAGGGCTACACGTGTGTAGCTGAGC^ 

AGATGCCTGCCCGCCGGGCTTCCTTATCCCACCCCAGAGACATCGGCTGTGACCAGCACACCAGCTGCCCGGTGGGGCAGACCT 
GCTGCCCGAGCCTGGGTGGGAGCTGGGCCTGCTGCCAGTTGCCCCATGCTGTGTGCTGCG 
10 CTGGCTACACCTGCMCGTGAAGGCTCGATCCTGCGAGMGGMGTGGTC 

CTCACGTGGGTGTGMGGACGTGGAGTGTGGGGMGGACACTTCTGCCATGATMC 
GCTGGGCCTGCTGTCCCTACCGCCAGGGCGTCTGTO 

GGGGTACCMGTGTTTGCGCAGGGAGGCCCGGCGCTGGGACGCCCCTTTGAGGGACCCAGCCTTGAGACAGCTGCTGTGAGGGA 
CAGTACTGMGACTCTGCAGCCCTCGGGACCCCACTCGGAGGGTGCCCTCTGCTCAGGCCTCCCTAGCACCTCCCCCTMCCAA 
1 5 ATTCTCCCTGGACeCCATTCTGAGCTC^ ' 
' GTGGCAAMGCCACATTAGAAGCTGCCATCCCCTCrc 
TGTTTGTGTGTGTGCGC^^ 

271 

MKHVLNLYLLGVVLTLLSIFVRVMESLEGLLESPSPGTSWTTRSQLANTEPTKGLPDHPSRSM 
20 272 

GCACGAGGGCGCTTTTGTCTCCGKjTGAGTTTO 

CTGCACGACCTGCTCCTACAGCCGGCGATCCACTCCCGGCTGTTCCCCCGGAGGGTCCAGAGGCCTTTCAGAAGGAGAAGGCAG 
CTCTGTTTCTCTGCAGAGGAGTAGGGTCCTTTCAGCCATGAAGCATGTGTTGAACCTCTACCTGTTAGGTGTGGTACTGACCCT 
ACTCTCCATCTTCGTTAGAGTGATGGAGTCCCTAGMGGCTTACTAGAGAGCCCATCGCCTGGGACCTCCTGGACCACCAGMG 
25 CCMCTAGCCMCACAGAGCCCACCMGGGCCTTCCAGACCATCCATCCAGMGCATGTGATMGACCTCCTTCCATACTGGCC 
ATATTTTGGMCACTGACCTAGACATGTCCAGATGG 

TATTACTAGGCCTTGAAGMCCTGTCTMCTGGATGCTCATTGCCTGGGCMGGCCTGTTTAGGCCG 

CTGTMTCCTAGCACTTTGGGAGGCTGAGGTGGGTGG 

ACCCCATCTCTACTAAAMTACAAMGTTAGCTGG 
30 GAGMTTGCTTGMCCCGGGGACGGAGGTTGCAGTGMCCGAGATCGCACTGCTGTACCCAGCCTGGGCCACAGTC 

CATCTCAAAAAAAAAAAGAAMGAAAMGCCTGTTTMTGCACAGGTGTGAGTGGATTGCm 

TCGCCCTTACCCCGGGGTCTGGTGTATGCTGTGC 

GTTGGGAGATGGTGATATTTTCAACCCTACOT^ 

TTTGGTGTTGAGCCTCTCTTCCACMGAGCTCCTC 
35 GAGGATGGAGTGTTCAGTGCCCATTTCTCATTTTACATTTTAAAGTC 

GGTGGGATGCCAMGCCTGCTCMGTTATGGACATTGTGGCC^ 

TATATATTTCAAAAAAAAAAAAAAAAAA 

273 

MSIEKIWAREILDSRGNPTVEVDLYTAKGLFRAAVPSGASTGIYEALELRDGDKQRYLGKGVLKAVD 
40 VEQEKLDNLMLELDGTENKSKFGANAI LGVSLAVCKAGAAERELPLYRHI AQLAGNSDLI LPVPAFNVINGG SHAGNKLAMQEF 
MI LPVGAESFRDAMRLGAEVYHTLKGVIKDKYGKDATNVGDEGGFAPNI LENSEALELVKEAIDKAGYTEKI VIGMDVAASEFY 
RIXSKYDLDFKSPTDPSRYITGDQLGALYQDFVRDYPWS^^ 

CNCLLLKVNQIGSVTEAIQACKLAQENGVrcVMVSHRSGETEDTFIADLWGLCTGQIKTC^ 
ARFAGHNFRNPSVb 

45 274 

ACCCGCGCTCGTACGTGCGCCTCCGCCGGCAGCTCCTGACTCATCGGGGGCTC 

CGCCTCCTCCGCCCGCCGCCCGGGAGCCGCAGCCGCCGCCGCCACTGCCACTCCCGCTCT 

ACCGCCACCGCCACTACCACCGTCTGAGTCTGCAGTCCCGAGATCCCAGCCATCATGTCCATAGA^ 
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ATCCTGGACTCCCGCGGGAACCCCACAGTGGAGGTGGATCTCTATACTGCCAMGGTCTTT^ 
GCCTCTACGGGCATCTATGAGGCCCTGGAGCTGAGGGATGGAGACAMCAGCGTTACTTAGGCAM 
GACCACATCMCTCCACCATCGCGCCAGCCCTCATCAGCTCAGGTCTCTCTGTC^^ 
CTGGAGTTGGATGGGACTGAGMCAMTCCMGTTTGGGGCCMTGCCATCCTG 
5 GCAGCTGAGCGGGMCTGCCCCTGTATCGCCACATTGCTCAGCTGGCCGGGAACTCAGA 

mcgtgatcmtggtggctctcatgctggcmcmgctggccatgcaggagttca 

cgggatgccatgcgactaggtgcagaggtctaccatacactcmgggagtcatc 

gtgggggatgmggtggctttgcccccaatatcctggagmcagtc 

GGCTACACGGAAAAGATCGTTATTGGCATGGATGTTGCTGCCTCAGAGTTTTATCG^ 
10 TCTCCCACTGATCCWCCCGATACATCACTGGG 

TCCATTGAGGACCCATTTGACCAGGATGATTGGGCTCCCTGGTCCMGT^ 

GACCTGACAGTGACCMCCCAAMCGTATTGAGCGGGCAGTGGMGAAMGGCCTGCMCTGTCTC 

ATCGGCTCGGTCACTGAAGCCATCCAAGCGTGCAAGCTGGCCCAGGAGMTGGCTGGGGGGTC 

GAGACTGAGGACACATTCATTGCTGACCTGGTGGTGGGGCTGTGCACAGGC^ 
15 CGTCTGGCTAMTACMCCAGCTCATGAGMTTGAGGMGAGCTGGGGGATGMGCTC 

CCCAGTGTGCTGTGATTCCTCTGCTTGCCTGGAGACGTGGMCCTCTGTC 

ATAGTTCACCCCCTGAGATCCCCTGAGCCCCAGGGTGCCCAGMCTTCCCTGATTGACCTGCTC^ 

GACCTCTTGCTGTCTCTGCTCGCCCTCCTTTCTC^ 

CTCTTCCCTCAGAAACTAGAMTGTGMTGAGGATTATTAT 
20 CGTCAGGGTTGGTGTGCTGAGGTGTTAGAGAGGGACCATGTGTC^ 

ATATGAGCCGTGAACTGTGCATAGTGCTGGGATGGAGGGGAGTGTTGGGCATGTGATCACGC 

ATTTATTTATTTATTTATTTTATTTGTTTTTC 

CTTGGGGGMCGATGTGTCTGTATTTCATGTGGCTGTAGATCCCMGATGA 

TCATAGAMGGGCCTTGACATCAGTTCCTTTGTGTGTACTCACTGMGCCTGCGTTGGTCCAGAGCGG 
25 GGAGTTTTCCTCTATACATCTCTCCCCMCCCTAGGTTCCCTGTTCTTCCTCCAGCTGCACCAGAGCMCCTCTCACTC 
GCCACGTTCCACAGTTGCCACCACCTCTGTGGCATTGAMTGAGCACCTCCAT^ 

275 

mplnrtlsmsslpgledvffidefdlenavlfevavffivankvggiytvlqtkakvtgdew 
palkrtldsmnskgckvyfgrwlieggplvvlldvgasawalerwkgelwdicnigvpwydreandavlfgfl™ 
30 seekphwahfhewlagvglclcrarrlpvatiftthatl^ 

hwmsqitaieaqhllkrkpdivtpnglnvkkfsamhefqnlhaqsmiqe 

KGADWLEAIjARLNYLLRWGSEQTWAFFIMPARTNN^ 
pKEDFTMMKRAIFATQRQSFPPVCTHMLD^ 

FPSYYEPWGYTPAECTVMGIPSISmSGFGCFMEEHIADPSAYGIYILDRRFRSLDDSCSQLTSFLYSFCQQSRRQRIIQRN^ 
35 TERLSDLLDWKYLGRYYMSARHMALS 

ERYDEDEEAAKDRRNIRAPEWPRRASCTSSTSGRKRNSVDTATSSSLSTPSEPLSPTSSLGEERN 

276 

CGCTTCGGGCAGGGGTGCGGTCTTGCMTAGGMGC^ 

CCTACCTGGTGCATTCCCTAGACACCTCCGGGGTCCCTACCTGGAGATCCCCGGAGCCCCCCTTCCTGCGCCAGCCATGCCTTT 

40 aaaccgcactttgtccatgtcctcactgccaggactggaggactgggaggatgmttcgacctggagm 
agtgcx;ctgggaggtggctmcmggtg(x;tggcatctacacggtgctgca 
cgacmctacttcctggtggggccgtacacggagcagggcgtcaggacccaggtggmctgctggagg 

GMGAGGACACTGGATTCCATGMCAGCMGGGCTGCMGGTGTATTTCGGGCGCTGGCTGATCG 
GCTCCTGGACGTGGGTGCCTCAGCTTGGGCCCTGGAGCGCT^ 
45 GTACGACCGCGAGGCCMCGACG(^TCCTCTTTGGCTTTCTGACC^ 
GMGCCACATGTGGTTGCTCACTTCCATGAGTGGTTC^ 
MCCATCTTCACCACCCATGCCACGCTGCTGGGGCC£TACCTGTGTGCC 

CMCGTGGACMGGAAGCAGGGGAGAGGCAGATCTACCACCGATACTGCATGGAMGGGCGGCAGCCCACT 
CACTACTGTGTCCCAGATCACCGCCATCGAGGCACAGCACTTGCTCMGAGGAMCCAGATATTG 
50 TGTGMGAAGTTTTCTGCCATGCATGAGTTCCAGAACCTCCATGCT 
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tttttatgggcatctggacttcmcttggacaagaccttatact^ 
tgacgtctttctggaggcattggctcgc<:tcaactatctgctcagagtc 
catgccagcgcggaccmcmtttcmcgtggamccctcaaaggccmgctgtgcgcam 
ggtgmggaamgttcgggaggaagctttatgmtccttactggttgggagccttc 
5 agacttcactatgatgmgagagccatctttcx:aacgcagcggcagtctttcccccctc 
ctcctcagaccccatcctgaccaccatccgccgmtcggcctcttcmtagcagtgccgacaggg 
ggagttcctctcctccacmgccccctgctccctgtggactatgaggagtttgtcc 
ctactatgagccttggggctacacaccggctgagtgcacggttatgggaatccccagtatc 
ctgcttcatggaggmcacatcgcagacccctcagcttacggtatctacattcttgaccggc^ 
10 ctgctcgcagctcacctccttcctctac^^ 

cctctccgaccttctggactggaaatacctaggccggtactatatgtctc 

gcacttcacctacgagcccmcgaggcggatgcggcccaggggtaccgctacccacggccagcctcggtgccaccgtcgccctc 
gctgtcacgacactccagcccgcaccagagtgaggacgaggaggatccccggaacgggccgctggaggaagacggcgagcgcta 
cgatgaggacgaggaggccgccmggaccggcgcmcatccgtgcaccagagtggccgcgccg 

1 5 CAGCGGCCGCMGCGCMCTCTCTGGACACGGCCACCTCCAGCTCACTCAGCACCCCGAGCGAGCCCCTCAGCCCCACCAGCTC 
. CCTGGGCGAGGAGCGTAACTMGTCCGCCCCACCACACTCCCCGCCTGTCCTGCCTCTCTO 

GTGCTGCTCCTAMCCCCCGATCCAGATCTGCACGGGGTGCGGCCCCGCAGTGCCCCCACCCAGTCC 

CTCCAGCTCCAGTTTCCMGTTCCTGCACTCCTGMTCCACAMGCCGTGCCTTTCTCTGGCTCCAGAATA 

CCCTGGAGTCCCCTGGGCCTGGACCGCTTCCCAGAGGCCAGGAMTCTGCCATTACTCTGCGGTGG 

20 ACCTGGCATGGTGCTTTCAGGTCTGGGGCTTTTAGAGCCCCCCGTGTGGCTTACAMTTC 
TCAGGCCCGGCATGCGGGCCACCMGTTCTGGAMCCACGTGGTGTC 
TTTCAGAGTTTGMGGTMCTGAGAGCAGATGGTCCTCCATTTCAACTCCAGMGTGG 
CCCTGGCTGTCAGAGCCAGGCTCTGCCTGGAGGATCCCTCCATCCGGCTCCTGTCATCCCCTACACTTTGG 
GGTAGMCCACTTGGCTGCTCACTCCTTC^ 

25 ACCAGATCCTACTCTTCCTTTCTAMTCTGAGATCTO 

ACCCCTMCCTGGCTTATTCCCMCTGCTCTGCCCACTGTGAMCCACTAGGTTCTAGGTCCTGGCTTCTAGATCTGGM 
ACCACGTTACTGCATACTGATCCCmCCCATGATCCAGMCTGAGGTCACTGGGTTCTAGMC 
TCTTCCATCCCCAAACTGTGCCCTGCCTTCAGCTTTGGTGAAAGGGAGGGCCCCTCATGTGTGC 
TGGTTTGCAGTTGAGAGGGGAGGGCAGGAGWGGTGTGATTGGAGTC 

30 AGA 

277 

MESKGASSCRLLFCLLISATVFRPGLGWYTVNSA 

EYKDRLNLSENYTLSISNARISDEKRFVCMLVTEDNVFEAPTIWVFKQPSKPEIVSKALFLETEQLKKLGDGISEDS 
TWYRNGKVLH PLEGAWI I FKKE14DPVTQLYTMTSTLEYKTTKADIQMPFTC SVTYYGPSGQKTIH SEQAVFDI YYPTEQVTIQ 
35 ■ VLPPKNAIKEGDNITLKCLGNGNPPPEEFLFYLPGQPEGI^ . 
NPSGEWRQIGDALPVSCTISASRNATVVWMKDNIRLRSSPSFSSLHYQDAGNYVCETALQEVEGLKKRESLTLIVEGKP 
TKKTDPSGLSKTIICHVEGFPKPAIQWTITGSGSVIN^^ 

I PEHDEADEI SDENREKVNDQAKLI VGI WGLLLAALVAGVVYWLYMKSKTASKHWDIX3MEENKKLEENNHKTEA 
278 

40 CGGGACGACGCCCCCTCCTGCGGCGTGGACTCCGTCAGTGGCCCACCAAGAAGGAGGAGGAATATGGAATCCAAGGGGGCCAGT 

TCCTGCCGTCTGCTCTTCTGCCTCmATCTTC^ 

GGAGATACCATTATCATACCTTGCCGACTTGACGTACCTCAGMTCTCA 

TCCCCAGTATTTATTGCCTTCAGATCCTCTACAMGAAMGTGTGCAGTACGACGATC^ 

CTCTCAGAAMCTACACTTTGTCTATCAGTMTGCAAGGATCAGTGATC 
45 MCGTGTTTGAGGCACCTACMTAGTCMGGTGTTCMGCM 

ACAGAGCAGCTAAAAMGTTGGGTGACTGCATTTCAGMGACAGTTATCCAGATGGCAAT^ 

GTGCTACATCCCCTTGMGGAGCGGTGGTCATMTTTTTAAAMG 

ACCCTGGAGTACAAGACAACCMGGCTGACATACAMTGCCATTCACCTGCTCGGTGACAT^ 
ACAATTCATTCTGMCAGGCAGTATTTGATATTTACTATCCTACAGAGCAGGTGACM 
50 ATCAMGMGGGGATMCATCACTCTTAMTGCTTAGGGM 
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cagcccgaaggmtmgaagctcamtacttacacactgatggatgtgaggcgcmtgcmcaggagactacmgtgttccctg 
atagacaaaaamgcatgattgcttcmcagccatcacagt 
agacagattggtgatgccctacccgtgtcatgcacaatatc^ 
aggcttcgatctagcccgtcattttctagtc^ 
5 gmggactamgaamgagagtcattgactctcattgtagmggcamcctcamtaaamtgacamg 
ggactatctaamcmtaatctgccatgtggmggmtccamgccagccattcagtgga 
atamccamcagaggaatctccttatattmtggcaggt^ 
acttgcacagcagaamccmctggagagaacagtamctccttgmtgtctctgct^ 
gacgagataagtgatgaaaacagagaamggtgmtgaccaggcaaaactaattgtgggmtcgttg 
10 gcccttgttgctggtgtcgtctactggctgtacatg^ 

atggaagaamcaaamgttagmgaamcmtcacaamctgmgcctmg 

atcatatagaccmttgmgcatgmcgtggattgtatttmgacatamcamgaca™ 

tmgcagttcattctaccmgctgtcacaggtttra 

GMCAAGTTTTGGCAGCCATGATMTAGGTCATATC^ 
1 5 CTTTTTGGTTTGCCTTTTATGTAMTTTTTO 

GATGTATMTGTMTGTTTTTATTTCMTTGTTO 
TGCCCCAMCTTTCTCATMGCACCTAAAACCCAMGGTGGCAGC^ 

ACTGTGATTTTTATCACAAGGGAGGGGAGGCCGAGAGTCAGACTGATAGACACCATAGGAGCCGACTCTTTGATATC 
CGMCTCTCAGAMTAAATCACAGATGCATATAGACACACATACATMTGGTACTCCCAM^ 
20 AAAGACATAAAACAGAATT 

279 

MERVKmNVQRLLEAAEFLERRERECEHGYASSFPSMPSPRLQHSKPPRRLSRAQ 
LCLERLKVLIPI£PDCTRHTTIX3LLNK^ 

RSDSEREEIEVDVESTEFSHGEVDNISTTSISDIDDHSSLPSIGSDEGYSSASVKLSFTS 
25 280 

AGATTATGATCGCCTGAGGCCCCTCTCCTACCCAGATACCGATGT^ 

AGTAATTMGGGTAGTTAMTTATTTAMGTATACAMGTCCAMCAGCCAGGGGTMGGTCTC 
AGGGAGTGCGGA6A6GCCCCGGTCGCCACCCGCGGTGCCCATGGAGCGGGTGM 
TGCCGAGTTTTTGGAGCGCCGGGAGCGAGAGTGTGAACATGGCTAC^ 
30 GCATTCAAAGCCCCCACGGAGGTTGAGCCGGGCACAGAAACACAGCAGCGGGACGAGCAACACCAGCACTGCCAACAGATCTAC 
ACACAATGAGCTGGAAAAGMTCGACGAGCTCATCTGCGCCTTTGTm 

CTGCACCCGGCACACMCACTTGGTTTGCTGMCAMGCCAMGCACACATCMGAAACTTGMGMGCTG 
GCACCAGCTCGAGMTTTGGMCGAGMCAGAGATTTTTAMGTGGCGACTGGMCAGCTGCAGGGTC 
MTACGMTGGACAGCATTGGATCMCTATTTCTTCAGATCGTTCTC 
35 CACAGAGTTCTCCCATGGAGMGTGGACMTATMGTACCACCAGCA 

TGGGAGTGACGAGGGTTACTCCAGTGCCAGTGTCAAACTTTCATTCACTTCATAGAACCCAGCA 
AAATATTCACTGGGCCAATTCMTACAMCAATC^ 

AMCAAMCTATACTTGAACAAMGGGTCAGAGGACCTGTATTTMGCAMTACTTAGCAAAAAGTGGG 
AGAACAAATATTCAGAATATTCATATTC^^ 
40 AGTTGATTGAGMGAGGACATTGGAGATGCCATCCTCTTTCTCTTTTCTC 

GGATGGGGTTATGMCCCTTCCTGAGCTTTATGGTCCTAAMGCAAMTAAAMCTATTCGM 

TATTMTCTTGGATAGCTMTMTGAGCTATTAAMCTC 

ATTATTATTTTTTTTTTTTGAAAA 
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CAGCACTTGTCTCATTTTAATGTAMG^ 
GTGCTATTGTTACTCTTACGMTATTTTCAGTMTC 
TTCCCTTTGTCTTTCAMCTCCMGGTTCCCCTGTGGCCCTCTCCCTT 
TGTTTGGACTTTGTATACTTTAMTMTTT^ 

5 282 

ATCGCAACCTGGGCCTCAGGATGCTGCTGCAGTGATGGCGCCAACCCGTGGCAGTCCCAGAGCTGGAGGCAGGAGGATGG 
TCATCTCCATGGGMGTGTCAGCGTGTGGCTGCCAGGGMGCGTGGCAGGCGCCTGGCCT^ 
TTTCMCAATGTCCATTTATCCTTCACCCCGAGGCGT^ 
CCTTCCCACCTGTGCCCGTCCCTTCCTGMGTOT 
10 CCCTGGTCACCCATCTGCCCCTCACCTCGTCATCCAGGGACCCAGACCCTGCACCTTCCATGTGGGC 
GTACCTGAGGTGCACCATTGAGTGTCGGATTTGGGGTTAGCATCCAGAMGMG 
CTCAGGTCTTCAGGGAGAGAAAGGAAGACTGGATTGCACCTTGAT^ 

GGCCCGGCCCAGCCTTATCCMGTCGCTCTGTCCACCTCCCCCTTCCTGGCCCCCACCCCACTCCTGTC^ 
CCCTGTGCTCCACCTGCCTCCGCAGMGGMGCCTCTTTCTCTGTTTCCCTGG 
15 TAGCATCTCCAGGCCCTGCCATGGTGTCTCATCTTGCTGTTATCTCTAGC 

TTTTGCAAAGCTTGTAGCAGTAGCTCAGTTGCCTGCAGCATCCTTGTGTGTAGATAMTTAG 
GACAGGATTGCAMGTCGGGGACATAGATGCAGACAGTTGTTGAGATTTGGGGATAGCCGGGCTT^ 
AGATGAAGCCTTTCAGCCOTTCTGAGTCCCCGGCrc 
AATGCTGAATAAAGTGGGTTTGTGTCAAAAAAAAAAAAAAAAAAAAAAAAAA 

20 283 

MDQKSLWAGVWLLLLQGGSAYKLVCYFTMSQ^^ 

TKNPKLKILLSIGGYLFGSKGFHPMVDSSTSRLEFINSIILFLRNHNFDGLDVSWIYPDQKENTHFmiHELAEAF 
TKERLLLTAGVSAGRQMIDNSYQVEKLAKDLDFINLL^ 

PSEKVVMGIPTYGHSFTMSAETWGAPASGPGMGPITESSGFMYYEICQFLKGAKITRLQDQQVPYAWGNQWGYDDVKS 
25 METKVQFLKNLNLGGAMIWSIDMDDFTGKSCNQGPYPLVQAVKRSLGSL 

284 

agmgmgctggccmggatatgggagcmccaccatggaccagmgtctctctgggcaggtgtagtggtc 
agggaggatctgcctacaaactggtttgctactttaccmctggtcccaggac^ 
atattgaccccttcctatgctctcatctcatctattcattcgccagcatcgaam 
30 mgtgatgctctaccagaccatcmcagtctcaamccmgm 
ttggttccaaagggttccaccctatggtggattc 

accatmctttgatggactggatgtaagctggatctacccagatcagamgaamcactca 
tagcagmgcctttcagmggacttcacaaaatccaccmggamggcttct^^ 
tgattgatmcagctatcaagttgagamctggcaamgatctggatttc 
35 gggaamgccccttatcactggccacaacagccctctgagcmggggtggcaggacagagggccaagctcctactacm 
mtatgctgtggggtactggatacataagggmtgccatcagagmggtggtcatgggcatc 
cactggcctctgcagamccaccgtgggggcccctgcctctc^ 

tggcctattatgagatctgccagttcctgamggagccmgatcacgcgcctccaggatcagcaggttccc 
ggaaccagtgggtgggctatgatgatgtgaagagtatc^ 
40 tgatctggtctattgacatggatgactra^ 

gccttggctccttgtgmggattmcmcagagmgcaggcmgatgaccttgct^ 

ctcatgtgggattccccttgccaggctggcctttggatctctc^ 

cctggttttgttrtcctgcagctg 

285 

45 MSLSENSVFAYESSVHSTNVLLSLNDQRK 

EPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEESC 

EVEEFLENKNVQTPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYR^ 
VQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEET 
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FAGMQNTTVLTEKPLSGTDVQEKTFGESQDLPLKSDLG^ 

MDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGNDDYVSEPQQEPCPYACVISLGDDS 
ETDTEGDSESCSAREQECEVKLPFNAQRI I SLSRNDFQSLLKMHKLTPEQLDCIHDIRRRSKNRI AAQRCRKRKLDC IQNLESE 
IEKLQSEKESLLKERDHILSTLGm^ 
5 SDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGGISDFCQQMTDKCTTDE 

286 

TCGCCCCCGCCGGGCGCTCTCGCTTCAGTCAGTCGGGCCGCGCCGCGCCTCAGCTC 

CCACTGMCTTCCCGACMCATTTGTTATGCAGAATG^ 

CACCMTGTTTTACTCAGCCTTMTGACCAG^ 
10 CCGCGCTCACCGGTCCGTGCTGGCGGCATGCAGCAGTTACTTCCACTC 

TACTCTTCCAGMGAGGTGACAGTTAMGGATTTGMCCTTTMTTCAGTTTC 

GAATGTGGATGMGTGTGCAMTGTGTGGAGTTm 

GTTTTTGGACTCCACTGCAGACCAGCMGAATGCCCMGAAAAAMTGCTTT 

TTCACTTTTGGACCAGAGGGATCTAGAMCTGATC^ 
15 ACTCCC£AGGTATCMGGAMTGCAAMGCCTCACCTCCTC 

GAAGGATGCTGCTCTGGCCTTGCCTTCTTTAT 

TGGGGAATCTAGTGTCAAAGACATTCATGCTO 

GTGTAGAGATTTGCAGGTGATGTTAAMTGTGACGAMGTAMTTAGCMTGGMCCTGMGAMCGM 
TCAGTGCCCMCTGAAAMTCAGMGTGACTCCTTTO 
20 TTTACACACATATGACCMTATGGTGACTTGMTTTTGCTGGTATGCAAAACA^ 

TACAGACGTCCMGAAAAMCATTTGGTGAAAGTCAGGATTTACCTTTGAMTCCGACTTGGGCACCAG 

TGCATCTAGTGATAGGAGTAGTGTGGAGCGAGMGTGGCAGMCACCTAGCAAMGGCTTCTGGAGTGACATTTC 

CACTCCTTGCCAAATGCAGTTATCACCTGCTGTGGCCAMGATGGCTCAGMCA 

GTTAGGTATCAGGATTAGTGAGAGCCCAGMCCAGGTCAMGGACTTTCACMCATTMGTTCTGTCMCTGCCCTTTTATAAG 
25 TACTCTGAGTACTGMGGCTGTTCMGCMTTTGGAMTTGGAMCGATGATTATGT^ 
ATATGCTTGTGTCATTAGCTTGGGAGACGACTCTC^^ 
ATGTGAGGTAAMCTGCCATTCMTGCACM^ 

GCTTACTCCAGMCAGCTGGATTGTATCCATGATATTCGMGMGMGTAAAMCAGMTTGCTGCACAG 

AAMCTTGACTGTATACAGAATCTTGMTCAGAMTTGAGA 
30 CATTTTGTCMCTCTGGGCGAGACAMGCAGMCCTMCTGGACTT^ 

ACAAATACAGATACTCGCCAAGTACTCAGCTGCAGAmCCCACTTTCATTTTTMTT^ 

TGGTGAACTGGCGTTACCATCAATTTTCAGTTTATCTGACCGGCCTCCAGCAGTGCTGCCTC 

GCCTGGCTACGCGCGAGGGCAGGAGTCCCAGCAGATGTCCACAGCCACCTCTO 

GAGTGGTGGGATCTCAGATTTCTGTCAGCAGATGACTGATAMTGTACTACTGATC^ 
35 CATCTMTTTTCTCCTGMGTTTTGGCAGCGTCTTGAMGCCTMTATGACCA 

TAGTAGTTTACCACAAGGGAATTTCCTOT^ 
■ TCCTGGATATCAGAAAAATCCATGTGAAMTGTAGTAMCCTTTAAMCTCATGTTTTAMG 

CTTCCTGCTATTCAGMTAAGTAGGAGMTGAAAAC 

TTCCTACTAAMGMGTGGCCTGCAGAAGTTTMTMTTTGACTTTTTTCTAATATTTT 

40 gcmtgctmtagagttctattcttagmgcagg 

ggtcttacctgmtcttagggcttoto^ 

agtggttggcctmgacgggggctgcttctcctcttcagtatggactctagam 

ctttccctccctgccccccgcttcagtctctacgatatctggtcccatcatc 

ggaagmgamtaggactcagaatacagtggcatgagtcattacactggcagca™ 
45 ttaccmcatgtamgctgttcatttttccaccgtgggtcaccaatgccagaamccagacatc^ 

actttttaccaggagtgcagttcatttttttc 

aaaaatgtatctgcagcmctctgcaggtttgtgamtaggatgamctc 

caggttgmtccttgctctcttctccamtttc^tgtg 

tcttagtgtttgttcagtcctgttaca 
50 gggttagatgtgtttgagagtcamtgamgctatggatc 

cattggcagmcccagttttmtggtacagaggagtagtttatagtgto 
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TCTATAGACTGCATCCTGAGCCTAGTGCAGGGCTTGTCTAGCTMTGTC 
AGCMGTTGGCCTTGCCCTTGAGAGTATATGGGGACCAGTCTTCATGTCTTGG 
AGGGTGTAGGGGGAGGATATTGCTAGTATATTTTCAGTGGTTTGTATGTTCTC 
AGTTGGACTTGTTTATTTTCTAGTAGCTmATMGTACACTC 
5 ACTGCAAMGMTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTA 
TTTGATACTTGTMTTTTAMTTTCAGC'TTCACGATATAAMTM 

cmtggmcccttgatttcctacctcagtgtacactcmc^ 

tttgctttaattgctactgtacttgctttgamgattaccta^ 

mtttgtamgcacttggtttamtttctctctacctatamcagtttagcattm 
10 ggccaagtgtaatttcttaaaatttagca™ 

gattttcaagttgtmtgcagatggacagataaaamgattttacgtttgtcttt 
- ttgcatagcattacacatttatgcctattttmc^ 

tttttgaggctgccgaagacaaatgacaggattatc^ 

cagtttgtatattgcatattcacatgatcattgctcactattttatgmctg 
1 5 tgttatgttgaattcagtaamtmcattaccttattttt 

tgtcatattcaaaacaaatgtggatacagtcttggttctccatc 

ctmttttaaataaggaaatmgtatgttagatgcagtagacgatacaggttgcatc 

ggaaaaaaatggcmtgttacggtgaattctcaggtgmctttot 

amtgttttattmggcatgtaatamctattctttc 
20 gcatactgattataaaagmgcagaaamcattgattttt^ 

ctaaacttttgtgtc 

287 

MVK IVTVKTQAYQDQKPGTSGLRKRVKVFQ S SANYAENFIQSI I STVE PAQRQEATLVVGGDGRFYMKEAI QL I ARI AAANGIG 
RLVIGQNGILSTPAVSCIIRKIKAIGGIILTASHNPGGPNGDFGIKF^ISNGGPAPEAITDKIFQISKTIEEYAVCPDLKVDLG 
25 VLGKQQFDLENKFKPFTVEIVDSVEAYATMLRSIFDFSALKELLSGPM 

PLEDFGGHHPDPNLTYAADLVEmSGEHDFGAAFDGDGDRNMILGKHGFFVNPSDSVAVIAANIFSIPYFQQTGVRGFARSMP 
TSGALDRVASATKIALYETPTGTOFFGNLMDASKLSLCGEESFGTGSDH^ 
GRNFFTRYDYEEVEAEGANKMMKDLEALMFDRSFVGKQFSAND^ 
SGTGSAGATIRLYIDSYEKDVAKINQDPQVMLAPLISIALKVSQLQERTGRTAPTVIT 

30 288 

gggccggccgcccctccgccagccaagtccgccgctctgacccccggcagcaagtcgccaccatggtgaagatcgtgacagtta 
agacccaggcgtaccaggaccagaagccgggcacgagcgggctgcggaagcgggtgaaggtgttccagagcagcgccaactacg 
cggagmcttcatccagagtatcatctccaccgtggagccggcgcagcggcaggaggccacg 
ggttctacatgmggaggccatccagctcatcgctcgcatcgct^ 
35 gmtcctctccacccctgctgtatcctgcatcattagaaamtcamgccattggtg 
cagggggccccaatggagattttggmtcaaattcm^ 

tccaaatcagcaagacmtomgmtatgcagtttgccctgacctgamgtagaccttggtc 

acttggaamtmgttcamcccttcacagtggamttgtggattc 

atttcagtgcactgamgmctactttctgggccamccgactgmgatctgtattgatgctatgca 
40 atgtamgmgatcctctgtgmgmctcggtgcccctg^ 

accaccctgaccccmcctcacctatgcagctgacctcgtggagaccatgm 

atggagatggggatcgamcatgattctgggcaagcatgggttctt^ 

acatcttcagcattccgtatttccagcagactggggtccgcggctttgcacggagca 

tggctagtgctacaaagattgctttgtatgagaccc^ 
45 ccctttgtggggaggagagcttcgggaccggttctgaccacatc 

ccatcctagccacccgcaagcagagtgtggaggacattctcamgatc 

atgattacgaggaggtggmgctgagggcgcaaacaamtgatgmggacttggaggccctgatgt^ 

ggmgcagttctcagcamtgacamgtttacactgtggagmggccgataactttgm 

tttcmgamtcagggcttgcgcctcattttcacagatggttctcgm 

50 CCACCATTCQGCTGTACATCGATAGCTATGAGAAGGACGTTGCCMG 



WO 02/46465 



PCT/G BO 1/05458 



355 

TTTCCATTGCTCTGAMGTGTCCCAGCTGCAGGAGAGGACGGGACGCACTG 

ATGTGGTACGTCCCTCCACCCCCGGACCCATCC 

TTTAGGATTTGACTTTTTCACTMCCAGTTC^ 

AGGGAMGAGGACCTGCGGGCTTAGATCMTCTCMTTCCTTTTCATGCC 
5 TTAGCCATCAGGTACAGTTTACACTACMTGTMGCTATAGGTGGAGCATC 
TGTGGCMTGAMTGATGGTGCMGTTCCTTTCTCTTTTGTGM 
TGCMTTTCTAGTGCCTTCTGTCCAATCAGTTCTTTCCTCTGAGTGAGA 
CATTGTCCCATTCACACAGATATTTTGGGATAATAAAGGAAAATAAGCTACA 

289 

10 MGSTVPRSASVLLLLLLLRRAEQPCGAELTFELPDNMQCF 

YDSFTFTASKNGTYKFCFSNEFSTFTHKTVYFDFQVGETHLCFLVDRVSALTQMESACVSIHEALKSVIDYQTHFRLREAQGRS 
RAEDIOTRVAYWSVGEALILLWSIGQVFLLKSFFSDKRTTTTRVGS. 

290 

GCACTGAGCCGCCGCeGCTTCCGGAAGCGCAGACCCGCTGGTGCCACGTTTATCCCCTTACATCCTCCTAGGACCCGG 
15 GTCGTCGCCCCAGCCCGCCGGGGGCMGCGCCCAGGTCCGCGGCCCTCGAGACGGGACCGAGAGCATCATGGGCAGCACTGTCC 

CGCGCTGCGCCTCCGTGCTGCTTCTGCTGCTGCTGCTGCGCCGGGCCGAGCAGCCCTGCGGGGCCGA 

CGGACMCGCCMGCAGTGCTTCTACGAGGACATCGCTCAGGGCACCMGTCGACCCTGGAGTTCCAGGTGATTACTGG 

ACTATGATGTAGATTGTCGATTAGAAGATCCTGATG6TMAGTC 

TCACAGCCTCCAAAAATGGGACATACAMTTTTGCTTCAGCMTGM 
20 TTCMGTTGGAGAGACCCACCTCTGTTTCCTAGTAGACCGAGTCAGTGCTCTTACCCAGATGGMTCTGCCTC 

ACGMGCTCTGMGTCTGTCATCGATTATCAGACTCATTTCCGTTTMGAGMGCTC 

ATACMGAGTGGGCTATTGGTCAGTAGGAGMGCCCTCATTCTTCTGGTGGTCAGCATAGGGCAGG 

TTTTCTCAGATAAAAGAACCACCACMCTCGTGT^ 
• TGCTGTCCTCTMTTMTTTTAGGTACyrGAAGAACTTMTATTGGC 
25 GATGTACMTGCATATTCCCAMCTGTGGAMC^ACACCTTTTTTTATTTGTAM 

CTATTCATGTTAAATATTCMCACC^ 

TTTGATTCCTTTTMCCATTTMGACTACTTTTCTTATAGGTAGTTGATATOT 
TAGGGAGGMGAAMTTGCCTTTTMTTGTTMTM^ 
CAGTTTTATGGAGGAAAMTAAATCTACCATAGGAATGTTAGTTMT^ 
30 GCMTTTTATTCTTTCGMGATTTCATTTAMTCTCTGTACTATGCATO 
TAMTAGGACAMGCAGGCCCTATGACTATTGTTTO 

GAAMGAMTTTCTTTMCAGGATTGCACCTATCMTTTCTTGTAGMTAAAAAAAMTC 
291 

MNGFTPDEMSRGGDAAMVAAWAAAAAAASAGNGTGAGTGAEVPGAGAVSAAGPPGMGPG^ 
35 FSKRIQKSISQKKVKIELDKSARHLYICDYHKNLIQSVRNRRK^ 

TRPGLNKAQLVEIVGCHFRSIPWEKDTLTYFIYSVKNDKNKSDLKVDSGVH 

'292 

cccccatgtgacagtgacggggtccccgctccaggagacgctcgagtctgcgtcccggccctcagcactgtccac 

gccagcagagaccagcaggcccgggacagttggtgtttggccgtgccg 
40 cggagcggagagaggcggagcggccaggagagaggggatttctgtcagcgccggcctcgggagctcggagacatgmc 

acgcctgacgagatgagccgcggcggggatgcggccgccgcagtggccgcagtggtcgctgccgcggcggccgccgcctcggcg 

gggmcx3ggacxggcgcgggcaccggggctgaggtgccgggcgcgggggcggtctc 

ccgggccccgggcmctgtgctgcctgcgggaggatggtgagcggtgcggccgggcggcaggcm 

atccagmgagcatctcccagmgaaggtgmgatcgagctggatmgagcgcmggcatctttac 
45 mcttmttcagagtgttcgamcagaagamgagaamgggagtgatgatgatc 

accccagaggttgatttataccaattacmgtamtacacttaggagatacaamgacacttcaagctaccaaccagaccagga 

cttmtamgcacmcttgttgagatagttggttgc 

atctactcagtgmgmtgacmgmcaaatcagatctcmggt^^ 
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TMCTTGGATGTTMCACTGTTTACTGTTTTTTCACATC 

TTTTATTTTCTTTGTTTCTGACTCTMT 

AAGGTTATTACTATTAAAAAAAAAAAAAAAAAA 

293 

5 mpsagtlpwqgiicnannpcfryptpgeapgwgnfmsiv^ 

qdflvdnetfsgflyhnlslpkstvdkmlradviiihkvflqgyqlhl^ 

lrsnmdilkpilrtlnstspfpskeiaeatkn^^ 

eggglkikslnwyednnykalfggngteedaetfydnsttpycndlmknl 

aevnktfqeiavfhdlegmvffielspkiwtfmen^^ 
10 vytwreafnetnqairtisrfmecvnlmlepiatevwlinksmellderkfwagivf 

rtnk i kdgywdpgpradpfedmryvwggfaylqdweqai irvltgtekktgvymqqmpypcyvddi flrvmsrsmplfmtlaw 

iysvaviikgiwekearuetmrimgldnsilwf^ 

cflistlfsranlaaacggiiyftlylpyvlcvawqdyvgftlkifasllspvafgfgceyfalfeeqg 
ixsfnlttsvsmmlfdtflygvmtwyi 
15 iqnlvkvyrdgmkvavdglalnfy 

fdmlweehiwfyarlkglsekhvkaemeqmaldvglp 
rgiwelllkyrqgrtiilsthhmdeadv^ 

kkedsvsqsssdaglgsdhesdtltidvsaisnlirkhvsearlvedigheltyvlpyeaakegafvelfheiddrlsdlgiss 
■ ygisettleeiflkvaeesgvdaetsexstlparrinirafgdkqsclrpfteddmdpndsdidpesretdllsgmdgkgs 
20 gwkltqqqfvallwkrlliarrsrkgffaqivlpavfvc^ 
lmaltkdpgfgtrcmegnpipdtpcqageeevot^ 
rkqntadilqdltgrnisdylvkt^^ 
flnslgrfmtgldtrnnwvwfnmg^ 

VSICVIFAMSFVPASFVmiQERVSKAmQFISGVKmYWLS^ 
25 LLLLYC^SITPLMYPASFWKIPSTAmLTSmFIGIN^ 

NQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEGVVFFLITVLIQYRFFIRPRPVNAKLS 

NDILEIKELTKIYRRKRKPAVDRIWGIPPGECFGLLGWGAGKSSTF 

QFDAITELLTGREHVEFFALLRGVPEKEVGKVGEMAIRK^ 

. PKARRFLMCALSVVKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIVVRIAGSNPDLKPVQDFF 
30 UVFPGSVPKEKHRIMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQWWFAKI^ 
AVLTSFLQDEKVKESYV 

294 

CAAACATGTCAGCTGTTACTGGMGTGGCCTGGCCTCTATTTATCTTCCTGATCCTC 
TATGMCMCATGAATGCCATTTTCCAAATAMGCCATGCCC^^ 
35 GCCMCMCCCCTGTTTCCGTTACCC^ 

CTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAGAMGACACCAGCATGAAGGACATGC 

CAGCAGATCMGAMTCCAGCTCAMCTTGMGCTTCMGATTO 

MCCTCTCTCTCCCAMGTCTACTGTGGACMG 

TTACATTTGACMGTCTGTGCMTGGATCAAMTCAGMGAGATGATTCAACTTGGTGACCM 
40 CTACCMGGGAGAAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCMCATGGACATCCTGMGCC^ 

TCTACATCTCCCTTCCCGAGCMGGAGCTGGCCGAAGCCACAAAMCATTGCTGCATAGTCTTGGGA 

TTCAGCATGAGMGCTGGAGTGACATGCGACAGGAGGTGATGTTTCTGACCMTGTGAACAGCTC 

TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCG^ 

MCTACAMGCCCTCTTTGGAGGCMTGGCACTGAG 
45 GATTTGATGMGMTTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAM 

TATACACCTGACACTCCAGCCACMC&CAGGTC^ 

GMGGCATGTGGGAGGAACTCAGCCCCMGATCTGGACCTO 

GACAGCAGGGACMTGACCACTTTTGGGMCAGCAGTTGGATGGCTTAGATTGGACAGCCCMG 
MGCACCCAGAGGATGTCCAGTCCAGTMTGGTTCTGTGTACACCTGGAGAGMGCTTTCAACGAGACTMCCA 
50 ACCATATCTCGCTTCATGGAGTGTGTCMCCTGMCMGCTAGMCCCATAGCMCAGMGTCTGGCTCATCM 
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GAGCTGCTGGATGAGAGGMGTTCTGGGCTGGTATTGTGTO 
. MGTACMGATCCGAATGGACATTGACMTGTGGAGAGGACAMTAAMTCMGGATGGGTACTGGGACCCTGG 
GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGA 
ACGGGCACCGAGMGAAMCTGGTGTCTATATGCMCAGATGCCCTATCCCTGTTACGTTGATC 
5 AGCCGGTCMTGCCCCTCTTCATGACGCTGGC^ 

GCACGGCTGAMGAGACCATGCGGATCATGGGCCTGGACMCAGCATCCTCTGGTTTAGCTGGTTCATOGTAG 

cttcttgtgagcgctggcctgctagtggtcatcctgmgttagga^ 
ttcctgtccgtgtttgctgtggtgacaatcctgcagtgcttcctga™ 

TGTGGGGGCATCATCTACTTCACGCTGTACCTGCCCTACGTCCTGTC 
10 ATCTTCGCTAGCCTGCTGTCTCCTGTGGCTTTTGG^ 
CAGTGOSACAACCTGTTTGAGAGTCCTGTC^^ 

TTCCTCTATGGGGTGATGACCTGGTACATTGAGGCTGTCTTTCCAGGCCAGTACGGM 

ACCMGTCCTACTGGTTTGGCGAGGAMGTGATGAGMGAGCCACCCTGGTTCCAACCAGMGAG 

GAGGAGGMCCCACCCACTTGMGCTGGGCGTGTCCATTCAGMCCTGGTAAAAGTCTACCGAGATGGGATGMGGTGGCTC 

15 GATGGCCTGGCACTGMTTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATG^ 
ATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTGGGAAMGACATTC 
CAGMCCTGGGGGTCTGTCCCCAGCATMCGTGCTGTTTGACATGCTGACTGTCGMGAACACATCTG^ 
AMGGGCTCTC'TGAGMGCACGTGMGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT™ 
AAMCMGCCAGCTGTCAGGTGGMTGCAGAGAMGCTATCTGTGGCCTTGGCCTTTGTCGGGGGATC 

20 GATGMCCCACAGCTGGTGTGGACCCTTACTCCCGCAGGGGMTATGGGAGCTGCTGCTC^ 
■ ATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGGATTGCCATCATC 

GGCTCCTCCCTGTTTCTGMGMCCAGCTGGGMCAGGCTACTACCTGACCTTGGTCMGAMGATGTGGAA 
TCCTGCAGAMCAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTC 

AGCGACCATGAGAGTGACACGCTGACCATCGATGTCTCTGCTATCTCCMCCTCATCAGGMGCATGTGTCTGMGCCCGGCTC 
25 GTGGMGACATAGGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTMGGAGGGA 

ATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCAGAGACGACCCTGGMGAAATA 

GMGAGAGTGGGGTGGATGCTGAGACCTCAGATGGTACCTTGCCAGCMGACGAMCAGGCGGGCCTTCG 

TGTCTTCGCCCGTTCACTGMGATGATGCTGCTGATCCAAATGATTC 

AGTGGGATGGATGGCAMGGGTCCTACCAGGTGAAAGGCTGGAMCTTACACAGCMCAGTTTGTGG^ 
30 CTGCTAATTGCCAGACGGAGTCGGAAAGGATTTTTTGCTCAGA 

AGCCTGATCGTGCCACCCTTTGGCMGTACCCCAGCCTGGMCTTCAGCCCTGGATGTACMCGMCAGTACACATTTGTCAGC 

MTGATGCTCCTGAGGACACGGGMCCCTGGAACTCTTAMCGCCCTCACCAAAGACCCTGGCTTCGGGACCC 

GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCAGTTCCCCAGACCATCATGGACCTC 

TTCCAGMTGGGAACTGGACAATGCAGMCCCTTCACCTGCATGCCAGTGTAGCAGCGACAAM 
35 TGTCCCCCAGGGGCAGGGGGGCTGCCTCCTCCACAMGAAMCAAMCAGTGCAGATATCCTTCAGGACCTGACAGGMGAMC 

ATTTCGGATTATCTGGTGAAGACGTATGTGCAGATCATAGCCAAM'GCT^ 

GGCGGCTTTTCCCTGGGTGTCAGTMTACTCMGCACTTC 

CACCTAMGCTGGCCMGGACAGTTCTGCAGATCGATTTCTCMC 

MTGTCMGGTGTGGTTCAATMCMGGGCTGGCATGCMTCAGCTCTTTCCTC 
40 MCCTGCAAMGGGAGAGAACCCTAGCCATTATGGMTTACTGCTTTCMTCATC 

GAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTC^ 

gtcgtattcctgatccaggagcgggtcagcamgcaamcacctgcagttcatcagtc 
tctmttttgtctgggatatgtgcmttacgttgtccctgccacactggtca™ 

TATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTTCTACTTTTGCTGTATG 
45 TCCTTTGTGTTCMGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGMCCTCTTCATTC 
ACCTTTGTGCTGGAGCTGTOCACCGACMTMGCTGA 

TTTTGCCTGGGACGAGGGCTCATCGACATGGTGAAAMCCAGGCAATGGCTGATG^ 
GTGTCACCATTATCTTGGGACTTGGTGGGACGAMCCTCTTCGCCATGGCCGTGGM 
CTGATCCAGTACAGATTCTTCATCAGGCCCAGACCTGTAAATGCAMGCTATCTCCTCTGM 
50 CGGGAMGACAGAGMTTCTTGATGGTGGAGGCCAGMTGACATOT^ 
CGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGG 
TCATCMCTTTCMGATGTTMCAGGAGATACCACTGTTACCAG^^ 



WO 02/46465 



PCT/GB01/05458 



358 

atccatgmgtacatcagmcatgggctactgccctcagtttc^ 
ttctttgcccttttgagag 

tatggagaaamtatgctggtmctatagtggaggcmcam^ 

gtggtgtttctggatgmcccaccacaggcatggatcccamgcccggcggttcttgtggmttgtg 
5 gaggggagatcagtagtgcttacatctcatagtatc^mgmtgtgmgctcttt 

aggttcaggtgccttggcagtgtccagcatctaaaamtaggt^ 

mcccggacctgmgcctgtccaggatttctttggacttgcatttcct 

cmtaccagcttccatcttcattatcttctctggccaggatattcagcatcctctcccagag 

gactactctgtttctcagacmcacttgaccmgtatttgtgmctttgccmggaccam 
1 0 ctctcattacacaaamccagacagtagtggacgttgcagttctcacatcttttctacaggatga ' 

gtatgaagmtcctgttcatacggggtggctgam^ 

amgagccagmgttgatgtgggmgmgtamctggatactg™ 

acaamttccattacaggggcagtgcctttgtagcctatc 

ctatacctatgtgamctctattatc^mcccmtggacatatgc^t™ 
15 attctcattggggtocmcmtmttcatc 

cattcactmgccatgccatgcccaggagactggtttcccggt^ 

tgccmgtttttcagamgtttgmgcaccatggtgtgt 

•ttgmtatcagttgacagmtggtgccatgcgtggctmcatcctgctttc 

catgcmcaaamtgtgggtgtctctaggcacgggamcttggttccattgttatattgtc 
20 cagggtcatccttatgagactcttamtatacttagatcctggtmgagc^ 
• • gctgctgmgccagggcatgggattaaagagattgtgcgttcamcctagggmgcctgt^ 

aacatggtacactgcatctcmgatgtttatctgacacmgtgtattatttctggctttt,tgmtt 

atggagttgtattttgacaaamtgtttgtactttttm 

gaatggcctctttgtagaaccctgtgk3tatagaggagtatggcca 
25 tcagtcatgactagtgcctagaaagcmtgtgatggtcaggatctcatgacattatat^ 

tactcttmtctcacttcatcmtcamtattttttgagtgtatc 

gagatattaagtctcagtacacttcctgtgccatgttattcagctcactg^^ 

gcccactgtmcaatactgggcagccttttttttttttt^ 

tctamcmtgmttcttcaacagggaamcagctagcttgaamcttgctgaaamcacmcttc 
30 ccttcamtmttggctttgcagatattggataccccattamtctgacagtctcamtttttc 

agaaamtataaamcmcaaatactrccatatggagcatttttcagagttttctm 

catttgtaaaaatactgtttcactmtactta 

aagttcaagtgatctttcaatatcattactaacttcttccac 

cagatttcamttaatctttctatattttttamtttacagaatattatatmcccactgctgaa 
35 ttagaagttamgtcmtattgattttamtatmgtmtgmggcatatttccmtaactagtgata 

tacagtatcttcaaamtacagmtttatagmtmtttctcctcatttmtatttttcaaaatc 

tttactaamtcgtattctaattcttcattatagtamtctatgagcmctcc 

ttttaaaaaatcaamggcactgtgmctattttgmgaamcacmcatttta^ 

agamcmtctatagttatacatcttcattaatactgtgttaccttttaamtagtm 
40 mttgtggtagamtttttaccmctctatactcm 

tcagamttctcaamtacgtgttcaaamtttctgctt^ 

atgagamtacagmgaaaatmtmgccctctatagatamtgcccagcacmttcact 

ctactgtatttcattatctgtactgaaagcamtgctttgtgactattamtc 

AAAAA 
45 295 . 

PPGPERSRLGLGVSLHQRSCPKCIAVFTRVSEPRIQFPASRILPSSNTSKDFDPVSGQSNYGGSQGSGQTLNRPPVASNPVTP 
LHSGPAPRMPLPASQNPATTPMPSSSFLPEANLPPPLMQY1TYPSTASQTNHCPRASSQP-TVSGOT 
SFIKSGPSVPPLWPPLPTTFQPGAPHGPPPAGGPPPVRALTPLTSSYRDVPQPLFNSAVNQEGITSNTMGSMVVH 
GGGLLATPQLTllTOPKMSRSVGYSYPSLPPGYQNTTPPGATGVPPSSLI^PSGPQAF 
50 NLLQERNMLPSTPLKPPVPNLHEDIQKLNCNPELFRCTLTSIPQTQALLNKAKLPLGLLLHPFK^ 

TYINPFVSFLDQRRmC^LCYRVNDVPEEFLYNPLTRWGEPHRRPEVQNATIEFMAPSEYMLRPPQPPVYLFVFDVS 
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GYLNSVCQSLLDNLDLLPGNTRTKIGFITFDSTIHFYGLQESLSQPQMLIVSDIEDWIPMPmLVNLNESKELVQDLLKTLP 
QMFTKTLETQSALGPALQAAFKLMSPTGGRMSVFQTQL^ 

DLFLLSGQYSDLASLGC I SRY SAGSVYYY P SY HHQHNPVQVQKLQKELQRYLTRKIGFEAVMRIRCTKGLSIHTFHGNFFVRST 
DLLSLPNWPDAGYAVQMSVEESLTDTQLV^ . 
5 MTASLSDARDALWAVIDSLSAYRSSVLSNQQPGLW^ ' 
LTTHPSLYRVDNLSDEGALNI SDRTI PQPPILQLSVEKLSRIX3AFLMDAGSVLMLWVGKNCTQNFLSQVLGVQNYASI PQPMTD 
LPELDTPESARIIAFISWLREQRPFFPILYVIRDESPMKANFLQMIEDRTESALSYYEFLLHIQQQVNK . . 

• 296 

cctccaggcccagmcggagccgccttggcctcggggtctccctacaccaacggtcctgtccaaaatgca™ 
10 agagtcagtgagccmggatacmtttccagcttccaggatccta^ 
ggccagtctmctatggtggttctcagc&atc 

cttcatagtggtcctgctccccgaatgccarpacctgcttctcagmcccagctactacaccmtgccttc 

cctgaagccaacctgccaccacctttgaa^ 

tcatcccmccaactgtatctggaaatacmgtttmccaca^ 
15 agcttcataaagtcaggtccttctgtacctcccttagtgmtccacctctgcctacmcttttcmccaggagctcctcatggg 

ccccctccagctggaggcccacccccagtgagggccctcacgcccctgacatcatcatatagagatgtaccccagcccttattt 

mttcagctgtcaaccaagmggtattacatcamtaccmtmcggatctatggtggtcca 

ggaggtggcttattggcaacaccacagcttactmcmgmto ' 

ccacctggttatcagaacacaacaccacctggtgcmctggagtaccaccctcttccttc 
20 tttactcagactcccttaggtgctmtca 

•mtcttcttcaagmagaaacatgcttccgtcmcacctttgmgcctccagttccamtttg 

mctgtmcccagagttatttcgatgcacgctgactagcattc 

GGGCTGCTGCTTCATCCTTTCAAAGACTTAGTGCMTTC * 

ACGTACATCMTCCTTTCGTCAGCTTTCTTGATCAAAGGAGATGGMGTGTAACTTATGTTATC 
25 GMTTCTTGTACMCCCTTTGACCAGAGTTTATGGAGAACCTC ' 

GCTCCTTCAGMTACATGTTACGACCACCTCAGCCTCCAGTGTATCTCTTTGTATTTGATGTC^ 

GGATACTTGMTTCAGTTTGCCAGAGTTTGTTAGACM 

ACATTTGACAGTACMTCCATTTCTACGGTCTTCAGGAMGTCTCTCT 

GTTTTTATACCTATGCCAGAGMCTTATTAGTAMCTTAMTGAAAGTAMGAGCTCGTC 
30 CAMTGTTTACCMGACTCTGGAGACCCAGAGTGCCTTGGGTCCTGCAC 

GGTCGMTGTCTGTCTTTCAMCACAACTCCCMCTCTTGGAGTGGGAGCCCTGAAACC^ 

TCTGCTMGGATATACACATGACACCATCCACTGACTTCTATMGAAATT^ 

GACTTATTCCTTCTCAGTGGACAGTATTCTGATTTGGCTTC 

TATCCCTCTTACCATCATCAGCACMCCCAGTCCMGTACAGAMT^ 
. 35 GGCTTTGAGGCAGTCATGAGGATTCGGTGCACCAAAGGTCTTTCCATTCATAC 

GACTTACTGTCTTTGCCTMCGTCMCCCAGACGCTGGGTATGCAGTAC^ 

TTGGTTTCTTTTCAGTCAGCACTCTTO^ 

GTTTCGACTCTGMTGATGTCTTTCTTGGAGCTGATC 

ATGACTGCCAGTCTGAGTGACGCTCGGGATGCTCTAGTGMTG^ 
40 AGTMCCAGCAGCCTGGACTCATGGTTCCTTTT.TCTTTGCGGCTTTTCC 

TTCCAGACTGGGACAMTGCACGTCTAGATGMCGCATTTTTGC 

CTCACAACTCATCCCAGTTTGTATAGAGTTGACMTCTCTCAGATGAGGG 

CCCCCCATTCTTCAGCTTTCAGTGGAGMGCTGAGCAGAGATG^ 

GTTGGAAAAMTTGTACACAGMTTTTCTCAGCCMGTTCTAGGAGTTCAAMCTATGCATC 
45 CTTCCAGMCTTGATACACCAGMTCTGCCAGMTMTAGCTTTCATCTC 

CTTTATGTAATAAGGGATGAGAGTCCMTGAAAGCAAACTTCCTTCAAAACATC 

TATTATGMTTCCTGTTGCATATACAGCMC^ 

MTAGTGCAGAATACCTGGAMTGTGTMTACCTTCTTTTTCTATTATC 

TGTGATTTCAACAACCTATAAGCAMTAAMGACCACAGCAGAGMTCAMCATGCMCTCTC 
50 AGAATATAGCTACGTATGATTGGATACTTTTTTCTTGCCMTTATCTTTC 

GCAMGTACATTTTGTAAAATAMGATTTCTCTGTTCTACATGTATATT^ 
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CATTTAAMCCTCACAAACCTMGTG^ 

GTAMTATMTGTAGGAAMCTAGMTTCATTCCCCACACGTGTCTTTT^ 
AAMCTAMCTAATTCTAGTATAGCTTGAGAAAMTATC^ 
TGAAGMCAMGTCACTTTGATTTGMGTGAGMCTATCATTAGGTGGTTTTC 
5 CTTMTCTATAGTGAAMGMTTTGAGCTGTCTTCATAMCACTGGGACTAGCMTGATAATAGGGA 
ATCTTGATCCTTTMGTGGATTTTATTTGGTGCATTTCTC 
MGAAATGAAAGGTTTCTAMGTGCTATCCAMTACATCAGTMCATTTTTC 
CATMGAMTATTTATGCTTGMTATGAAMTCTATGAMGCATAMTGCTGCTGTTTC 
CATCCAACATATTAAAGmTGAMGAMCTTGACTTCTGAAAATCCTO 
10 TATAGTCAAMCTATTGAGTGMTTTTGTTTACAMTAGGTAMTTATA 
TTMGAGTTTGGCTCAGTTTTCACAGATTC^^ 
TTTTTGGGTTTTTTGTTTTGTTTTMTGGGTT^ 

CCAGGTTTATATCMTATGAGCTGACTTTMCTGAGTTGTTTGGGATAGGGMGMGCAGTCC^ 

CTTGCCAGCTGGATCAAMTMTCATGTTTTATGAAMTATC 
15 CACATTGMCTGGMGTTTTCTTGAMGCTGCTTCATCTATTMGMGCMT^ 

TTTTAAAGAMCAGTCGTTATATGCTGATGTTTCTTAAMTM 

GTAGGTCCTTTCTTAGTAGTAAAGMTCTTCTAGAGGGAMCATTTC 

CCCTMGTGGGTATGACTCTTGTTATTACCACATGCTTTTTO 

GGTGTAACATGTGCMTTCAGMTMTTTTATMCA 
20 ACATAATGAGAGATGMTGAGCCTTTGGAGATACTGATATMGGCMTTATTTTTTGCMTGT 

TTCTTTTTTTTTCCCCCMTAGGGCACTACCTGCCATATCATCTTGTATTACTO 

TGCCATATTTTTTTTMTTATAGTTGTAMTTATGAAAGATCC™ 

AGGGCMTCTTGGTATTTAAATCTGAGCATGGCAGT^ 

TMCATTTCTGTMGTCTGACATACTMTAGTCACTCMGCAGTACCATTTATTTTAGTTTGCA 
25 TTTMTGTATTGAGTCTMTAGACTGTTTTGCMTMTTAGMTAMGATTT 

TATCTAGGGGACCACCTAMTGTGATTTCAAMTTTTGTTMCTATTACAMTGTMTCCTTATATAGAM 
MGTAGTGTATMTATTGTMTATTAMTTCTTGTTOT 

CTTCTCTGAAMGTTGGAGACMGATTTGTCTTCCTTTTTACAGTTTGTMTTTTCACTC 
AAA 

30 297 

MIMLPLFLIjASWLPSICSHFNPLSLEELGSNTGI 

VNGVGKILKKINKAIVSKKNKDIVTVANAVFVKNASEIEVPFVTRNKDW 

LSPDLIIXJVLTRLVLVNAVYFKGLWKSRFQPE 

MLIAL^ESSTPLSAIIPHISTKTIDSWMSIMVPKRVQVILPKFTAVAQTDLKEPLKVLGITDMFDSSKANFAKITR^ 
35 HILQKAKIEVSEDGTKASAATTAILIARSSPPWFIVDRPFLFFIRHNPTGAVLFMGQINKP 

298 

ATGMCTGGCATCTCCCCCTCTTCCTCTTGGCCTCTGTGACGCTGCCTTCC 
GMCTAGGCTCCMCACGGGGATCCAGGTTTTCAATCAGATTGTGMGTCGAGGCCTCATGAC 
GGGATTGCGTCGGTCCTGGGGATGCTTCAGCTGGGGGCGGACGGCAGGACCMGAAGCAGCTCGCCATGGTGATC 
40 GTAMTGGAGTTGGTAAMTATTAMGMGATCMCMGGCCAT 

GCCGTGTTTGTTMGMTGCCTCTGAAATTGAAGTGCCTTTTGTTACAAGGM 
GTGMCTTTGAGGATCCAGCCTCTGCCTGTGATTCCATC^ 

CTGTCCCCAGATCTTATTGATGGTGTGCTCACCAGACTGGTGCTCGTCMCGCAGTGTATTTC 
TTCCMCCCGAGMCACAAAGAMCGCACTTTCGTGGCAGCCGACGGGAMTC 
45 GTGTTCCGGTGTGGGTCGACMGTGCCCCCMTGATTTATGGTACMCTTC 

ATGCTGATTGCACTGCCGACTGAGAGCTCCACTCCGCTGTCTGCCATCATCCCACACATCAGC 

ATGAGCATCATGGTCCCCMGAGGGTGCAGGTGATCCTGCCCAAGTTCACAGCTGTAGCACAM 

AMGTTCTTGGCAT"TACTGACATGTTTGATTC^ 

CATATCTTGCAAAMGCAAAMTTGMGTCAGTGMGATGGMCCAMGCTTCAGCAGCAACM 
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TCATCGCCTCCCTGGTTTATAGTAGACAGACOT 
CAGATAAACAAACCC 

299 

MSQRPRAPRSALWLLAPPLLRWAPPLLTVLHSDL^ 
5 TDTLEAPGYELQVNGTEGEMEYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAMQDGRLRVNDSILFVN 
REWHSMVEALKEAGSIVRLYWRRKPPAEKVMEIKLIKGPK 
DKILAVNSVGLEDVMHEDAVMLKNTYDVWLKVM^^ 

YSPVAKDLLGEEDIPREPRRIVIHRGSTGLGFNIVGGEIXSEGIFISFIUGGPADLSGELRKGDQIIjSVNGVDLRNASHEQAAI 
ALKNAGQTVTIIAQYKPEEYSRFEAKIHDLREQLMNSSLGSGTASLRSNPKRGFYIRALFDYDKTKDCGFLSQALSFRFGDV^ 
10 VIDASDEEWQARRVHSDSETDDIGFIPSKRRVERREWSRLKAK^ 

RANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDYHFVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILD 

VSAMVRRLQMHLHPIAIFIRPRSLENVLEINKRITEEQARKAFD^ 

PYIWPARERL • ' 

15 300 

GGATCCGCGGGACAGATGAGGMGGGGCTTMGTCACTGCAGCCAGAGGGATGGAGGTGGACTGATGGGAGGGCTTCTCCGGTG 
GGGTTAGMGGGAAMGTAGGGAMGAGMGTGTMGGTAGATGGCAGAGGCAGAGACATGGAMGACAGACTCTAGGGTTCCT 
GATGATATCTATCTCGGCCMCACAAMGGGAGGGTACAGTGGTGGGGGCACCCMGCTAGGGTGTGAGTACCCTMGTGTATT 
CTTCTGAGATGTAGGCCATTCACTMCTCTTGGMCAGCTACAGTTTCACAGTAGGMGACCCCCCCAGATTCACTGC 
20 CTOGTAMGCCTCTCAGACCTO 

TGGATTCCTCCCGGGCCTGAGAGGMCTGCAGGMTTCTCCTGCCTCTTACCCGTAAMCCCCMCTTCTCTAGCCCTAG 

GGMGTCCCAMCAATTTCTACCCCTTTTTCTG 

GCTTCTTTTTGAGCTGCTGGAGCCCTCTGTGAGGAGGCC^ 

TATGGGMGAGGGGGCCAGGGTGTGTGGAGCMGATGGTGCGGTGCTGGTGCCTTGGGA 
25 GCTCAGAGACGGCCTACTTTACTCACAGCTGGMTTTAGTGGGGAGMGCAGCTCMCTCCMTCCTGGA 
AMGTGAGAGMGAGAGAGATGTCCCAGAGACCMGAGCTCCCAGGTCAGCCCTCTGGCTO 
GGCACCCCCACTCCTCACAGTGCTGCATAGCGACCTCTTCCAGGCCTTGCTGGACATCCTGGACTAT^ 

agagagtcagamtaccgctaccmgatgmgacacgccccctctggagca^ 
tcccccagtgattgtcmcacagataccctagmgrc^ 
30 cgaggamtcacattggaaaggggtmctcaggtctgggcttcagcatcgcaggtggcactgacaacccacacatcggtgac 
cccatccattttcatcaccmgatcattcctggtggggc 

TGTAMTGMGTGGACGTGCGCGAGGTGACCCACTCAGCGGCGGTGGMGCCCTCAMGAGGCAGGCTCCATC 
TGTCATGCGCCGGMGCCCCCGGCTGAGMGGTCATGGAGATCMGCTCA^ 

AGGGGGCGTAGGGMCCAGCACATCCCAGGAGATMTAGCATCTATGTAACAMGATCATCGAAGGGGGTGCTG^ 
35 TGGGAGGTTGCAGATTGGAGACMGATCCTGGCGGTCMCAGTGTGGG 
CCTGMGMCACGTATGATGTTGTCTACCTAMGGTGGCCM^ 
CATCACMCCTCTTATTCCCAGCACCTGGACMTGAGATCAG 
CCCCACTTCCCCTCGGCGCTACTCTCCAGTGGCCMGGACCTGCTC^ 

GATCCACCGGGGCTCCACGGGCCTGGGCTTCMCATCGTGGGTGGCGAGGACGGTGAAGGCATCTTC 
40 CGGGGGCCCTGCAGACCTCAGTGGGGAGCTGCGGMGGGGGACCAGATCCTGTCGGTC 
CCATGAGCAGGCTGCCATTGCCCTGMGMTGCGGGTCAGACGGTCACGATCATC 
ATTCGAGGCCMGATCCACGACCTTCGGGMCAGCTCATGMCAGCAGC 
CAAMGGGGTTTCTACATCAGGGCCCTGTTTGATTACG 

CTTTGGGGATGTGCTGCATGTCATCGATGCTAGTGATGAGGAGTGGTGGCAGGCACGGCGGGTCCAC 
45 CGACATTGGGTTCATCCCCAGCAMCGGCGGGTTGAGCGACGAGAGTGGTCMGGTTAAA^ 

TGGATCGCAGGGTCGAGMGACTCGGTTCTGAGCTACGAGACAGTGACGCAGATGGMGTC 

CCTTGGGCCCACCMGGACCGCGCCMCGATGATCTTCTCTCCGAGTTCCCCGACM^ 

ACGGCCCMGCGGGAGTATGAGATAGATGGCCGGGATTACCACTTO 

GCACMGTTCATTGAGGCCGGCCAGTACMCAGCCACCTCTATGGGACCAGCGTCCAGTCCGTC 
50 GMGCACTGCATCCTCGATGTCTCGGCCMTGCCGTGCGGCGGCTGCAGGCGGCCCACCTGCACCCCATC 
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CCCCCGCTCCCTGGAGMTGTGCTAGAGATTMCMGCGGATCACAGAGGAGCMGCCCGCAMGCCTTCGACAGAGCCACCAA 
GCTGGAGCAGGAGTTCACAGAGTGCTTCTCAGCCATCGTGGAGGGTGACAGCTTTC 
CATCGAGGACCTCTCAGGCCCCTACATCTGGGTTCGAGCCCGAGAGAGACTCTC 
CTGCCTCCATCACCTGGGCCCTTGGTCTGGACTGMTTGCCCMGCCCTTGGCTCCCC 
5 TTATTTCCTTTCTMCTGGATCCAGCCTGTTGGAGGGGGGACACTCCTC 
MCGCCAGGMCCTGGGGTCTGGGGGGGAGCTGGGCTCCT^ 

CCMTGCACACACAGACCCACCGGGGGCCACCTGCCCTCCCCCATCCTCTCCCACACACATTCCAGMGTCAGGGCCCCCTCG 
GGAGCACCCGCTGCAGGGATGCAGGGCCACAGGCCTCCGCTCTCTCCTAAGGCAGGGTCTGGGGTCACCCCTC 
TTCCCCATGTTACCTTGATTTCTCATTTATTTTTTCCACm 
10 CTCCGCAGCGGGCCCCTGCCTTCCACATGCCCCCACCATmTC 
TTTTTTTTTTTTCCTGGGMTTTTTTTGGGGAAMGG 
GGGGTCGGGGGTCGGGTCTCCGGGAGCCAGGGMGACTGGAMTC^ 
AGAGTGAAAGGAAGAGACAGATACTTGAAAAAAAAAAAAAAAAAAAA 

301 

15 MDRGEQGLLRTDPVPEEGEDVMTISATETLSEEEQEE 

KGWQDVTATSAYKKTSETLSQAGQKASAAFSSVGSVITKM 
SATTTEPLPEKTQESL 

302 

gaggagctctgcgcggcgcgkx?gggcgatccgag£cgggac 
20 tggagacatggaccgcggcgagcmggtctgctgagmcagacccagtccctgaggmggagmgatgtt^ 

tgccacagagaccctctcggaagaggagcaggmgagctmgmgagmcttgcamggtagmgmgamtccagactctgtc 

tcmgtgttagcagcaaaagagaagcatctagcagagatcmgcggamc1tcgmtcaattctcta 

cattgccaaagggtggcmgacgtgacagcmcatctgcmcaagmgacatctgamccttatc 

ctcagctgctttttcgtctgttggctcagtcatcaccaaamgctggmgatgtaaaam 
25 agaamggtcgaaaacttamgtctamgtagggggmccmgcctgctggtggtgattttggag 

amtgctagtgccaccaccacggagcctcttccagaamgacacaggagagcctgtgagattcctacctttgtrc 

ctcccagatgctgcmgcgaggtccmgcacatcttgtcm 

tttacatattcctttgaccamtagtttgtgggttamcaam 

tgtacatttatgttcctttatttaggaaacac(^ttataaamcacttatagtam 
30 agctacagattgtcatagttgttotcctcctttac 

atttcctttttttcccctctttgm 

ttaamgtttgggtgtattttctttaccto 

ataaggtacatattttggttgmgacaccagactg^ 

tgtmtcaaacttctaggtgacttgagagtggmcctcctat^ 
35 attctttattataccacttatatcaacttatttttcacca 

cactgtatgttcagtaggttgmctatgmcactgtcatcmtc 

aaamtgacmtatcaatcacattaggggmccattgttg 

gttatatgactmtataacttgaamttttttatactgaggggttggtgataactc 

mctcatcattttctacttgttttcmtgtgttggam 
40 tgtcagggctgagtgaatcamgtgtctagacatatttgcat 

taccaccttttccttatactgtatatgattatx^c^ 

tttatgccamtgttaactgccmgcttggagtgacctamggatttttt 

tcttatgaamccamttttaaaagccacaggtgttgatottataamtmcatc 

ttagtactttggatgcaattaaaactatc 
45 atatttattatgagaccagtggtctggamcagcttgttgtaccgmtcm 

ttmccatccttaaattattgcttm^ 

ttctcagaggagtggatctgtagmgtctgtctttctatagamtattgtgcttactcm 
ctagtctacttcttaaaattcamc 

GGCAAMTGGAGTAMGTGGCMTTTCTTTAGATGAGTC 
50 GAMTGGGGGTGGCAAGTGGGGTGAGTGGATGAAGGTGGGTATTGGGGGTC 
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TAGACATCTACCTATACTTMTCTAAGAM^ 
GAGGATGGATTTAGAAAAAAACATGMTTTAGA 
TTTAMGMTGAGAAACATAGTMTTAGTAGAMTGCCAGAAACAGTTC 
AAGAGTGTGAAATCTCAGATACTTATGAAATCTCACAGATGT^ 
5 GAMTGTCMTmTGGAGTTTGTGGGAAATAGGGCAAGMTTCTTA 

AGCTCAAMCTGGATTATCACCTTAGCMCTTAGMTAGTTTGAMTAGAAAAAAAGTATTTMT^ 
GATATGCACAGTCTATTTTTTGTATAGTATTGGAAMTAAAMTGCTATMTTTG 

■ 303 

MSWRVSNGSPSLERMDARQAEHPKPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKWN^ 
10 PEFYYRPPRPPKGACKVPAQESQDVSGSRPMPLIGAPANSEDTHLVDPKTDPSDSQTGLAEQCAGIRKRPATDDSSTQNKRAN 
RTEEWSDGSPNAGSVEQTPKKPGLRRRQT 

304 

ATGTCAAACGTGCGAGTGTCTAACGGGAGCCCTAGCCTGGAGCGGATGGACGCCAGGCAGGCGGAGCACCCCAAGCCCTCGGCC 
TGCAGGAACCTCTTCGGCCCGGTGGACCACGAAGAGTTAACCCGGGACTTGGAGAAGCACTGCAGAGACATGGAAGAGGCGAGC 
15 CAGCGCMGTGGMTTTCGATTTTCAGMTCACAMCCCCTAGAGGGCMGTACGAGTGGCMGAGGTGGAGM 

CCCGAGTTCTACTACAGACCCCCGCGGCCCCCCAAAGGTGCCTGCAAGGTGCCGGCGCAGGAGAGCCAGGATGTCAGCGGGAGC 
CGCCCGGCGGCGCCTTTMTTGGGGCTCCGGCTMCTCTGAGGACACGCATTTGGTC 

CAGACGGGGmGCGGAGCAATGCGCAGGMTMGGMGCGACCTGCMCCGACGATTCTTCTACTCAAMCAAAAGAGCCMC 
AGMCAGMGAAAATGTTTCAGACGGTTCCCCAMTGCCGGTTCTGT^ 
20 CAAACGTAA * . 

305 

MCFSFIMPPAMADILDIWAVDSQIASIX3SIPVDFLLP 

QTAVYEELEDETRRLCDVRPFLPVLKLVTRSCDPGEKLDSKIGVLIGKGLHEFDSLKDPEVNEFRRKMRKFSEEKILSLVGL 
MDVJLKQTYPPEHEPSIPMLEDKLYGGKLIVAVHFENCQDVFSFQVSPNMPIKVNELAIQKRLTIHGKEDEVSPYDW 
• 25 RVEYVFGDHPLIQFQYIRNCVMNRALPHFILVECC^ 

LVKGNKLNTEETVKVHVRAGLFHGTELLCKTIVSSEVSGKNDHIWEPLEFDINICDLPRMARLCFAW 
NPSKYQTIRKAGKVHYPVAWVNTMVFDFKGQLRTGDII^ 

YPPFDKIIEKMEIASSDSANVSSRGGKKFLPVLKEILDRDPLSQLCENEMDLIWTLRQDCREIFPQSLPKLLLSIKWNKLEDV 
AQLQALLQIWPKLPPREALELLDFNYPDQYVREYAVGCLRQMSDEELSQYLLQLVQVLKYEPFLDCALSRFLLERALGNRRIGQ 
30 FLFWHLRSEVHIPAVSVQFGVILEAYCRGSVGHMKVLSKQVEALNKLKTLNSLIKLMVKLNRMGKEAM 
DLQSPLNPCVILSELYVEKCKYMDSKMPLV^VYNNKWGEDSVGVIFKNGDDLRQDMLTLQML^ 
CLATGDRSGLIEWSTSETIADIQLNSSWAAAMFNKDALL^ 
IMVKKTGQLFHIDFGHII^NFKSKFGIKRERVPFim 
TAGLPELTSVKDIQYLKDSLAK^^ 

35 306 

ATGTGCTTCAGTTTCATAATGCCTCCTGCTATGGCAGACATCCTTGACATCTGGGC 
TCCATACCTGTGGATTTCCTTTTGCCCACTGGGATTTATATCCAGTTGGAGGTACCTCG 
CAGATGTTATGGMGCAAGTTCACMTTACCCMTGTTCMCCTCCTTATGGATATTGACTCCTATATC 
CAGACTGCTGTATATGAGGAGCTTGMGATGAMCACGMGACTCTGTC^^ 
40 ACMGMGTTGTGACCCAGGGGAAAAATTAGACTCAAAAATTGGAG 

MGGATCCTGMGTAMTGMTTTCGMGAAAAATGCGCAMTTCAGCGA 

ATGGACTGGCTAAMCAMCATATCCACCAGAGCATGMCCATCCATCCCTGAAMCTTAGMGATAMCTTTATGGG 
CTCATCGTAGCTGTTCATTTTGAAMCTGCCAGGACGTGTTTAGCTTTCMGTGTCTCCTM 
GMTTGGCAATCCAAAAACGTTTGACTATTCATGGGAAGGMGATGMGTTAGCCCCTATGATO 
45 AGAGTAGMTATGTTTTTGGTGATCATCCACTMTO 
TTTATACTTGTGGAATGCTGCAAGATCAA 

TCTMTCTTCCTCTTCCATTACCACCAMGAAMCACGMTTATTTCTC 
TTGGTTMGGGAMTAMCTTMCACAGAGGAAACTGTAAMGTTCATGTCAGGGCTGGTC 
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TGTAAMCCATCGTAAGCTCAGAGGTATCAGGGA^ 
GACTTACCMGMTGGCTCGATTATGTTTTGCTGTTTATC^ 

MTCCCTCTAMTATCAGACCATCAGGAMGCTGGAAMGTGCATTATCCTCTAGC^^ 
AMGGACMTOAGMCTGGAGACATMTATTACACAGCTGGTCTTCATTTCCTGATGMCTC 
5 GGMCTGTTCAMCAAATCCATATACTGAAMTGCAACAGCTTTGCATC 
TACCCTCCCTTCGATMGATTATTGAAMGGCAGCT 
MGTTTCTTCCTGTATTGAMGAAATCTTGGACAGGGATCCCTTGTCTCMC 
TTGCGACMGACTGCCGAGAGATTTTCCCACAATCACTGCCAAMTTACTGCTGTCMTCM 
GCTCAGCTTCAGGCGCTGCTTCAGATTTGGCCTAMCTGCCCCCCCGGGAGGCCCTAGAGCTTC 
10 CAGTACGTTCGAGMTATGCTGTAGGCTGCCTGCGACAGATGAGTGATG^ 
GTGTTAAMTATGAGCCTTTTCTTGATTGTGCCCTCTC 
TTTCTATTTTGGCATCTTAGGTCAGMGTGCACATTCCTGCTGTCTCAGTACAA 

GGMGTGTGGGGCACATGAMGTGCTTTCTMGCAGGTTGAAGCACTCMTMGTTAAAMCTTTAMTAGTTTMTCAMCTC 
MTGCCGTGAAGTTAMCAGAGCCAMGGGMGGAGGCCATGCATACCTGTTTAAMCAGAG 
15 GACCTGCAGTCACCCCTGMCCCATGTGTTATCCTCTCAGMCTCTATGTTGAAMGTGCAAATACATGGA 
CCTTTGTGGCTGGTATACAATMCMGGTATTTG^ 
GATATGTTGACACTCCAMTGTTGCGCTTGATGGAT™ 

TGTTTAGCMCAGGAGATCGCTCTGGCCTCATTGMGTTGTGAGCACCTCTGAAACMTTGCTGACATTCAGCTGM 
AATGTGGCTGCTGCAGCAGCCTTCMCAMGATGCCCTTCTGMC^ 
20 GCCATTGAGGMTTTACACTGTCCTGTGCTGGCTACTGTGTAGCTTCTTATGTCCTTGG 

ATCATGGTCAAAAAAACTGGCCAGCTCTTCCACATTGACTTTGGACATATTCTTGGAMTTTCAMTC 

AGGGAGCGAGTGCCTTTTATTCmCCTATGATTTCATCCATGTCATTCAA 

CGGTOCCGCCAGTGTTCTGAGGATGCATATCTGATTTTACGACGG 

ACTGCAGGGCTTCCTGAACTCACATCAGTCAAAGATATACAGTATCTTAAGGACTCTCTTGCATTAGGGAAGAGTGAAGAAGAA 
25 GCACTCAMCAGTTTAAGCAAAMTTTGATGAGGCGCTCAGGGAMGCTGGACTACTAMGTC 
CGGAAAGACTACAGATCTTAA 

307 

MRAVLDTADIAIVALYFILVMCIGFFAMWKSNRSTVSGYFI^^ 
• MLLLLQLLGWFIPIYIRSGWTMPEYLSKRFGGHRIQVYFAALSLILYIFTKLSVDLYSGALFIQESLG^LWS 
30 TALLTVTGGLVAVIYTOTLQALLMIIGALTL 

NPTDEDVPWPGFILGQTPASVWYWCADQVIVQRVLAAKNIAHAKGSTLMAGFLKLLPMFIIWPGMI^ 
. . MLVCGSRAGCSNIAYPRLVMLVPVGLRGLMMAVMIMLMSDIiDSIFNSASTIFTLDWKLIRKSASSRELMIVGRIFVAFMW 
I SI AWVP 1 1 VEMQGGQMYLYIQEVADYLTP PVAALFLLAI FWKRCNEQGAFYGGMAGFVLGAVRL I LAFAYRAPECDQPDNRPG 
FIKDIHYMWATGLFWVTGLIWIVSLLTPPPTKEQIRTO 
35 ( GKSEDSIKGLQPEDVNLLVTCREEGNPVASLGHSEAETPVDAYSN^ 
SLRDLMEEEAVCLQMLEETRQVKVILNIGLFAVCSLGIFMFVYFSL 

308 

GCGGCCGCCCGCGGGCGGAGAGGGGGCGGCGCGGCGGCAAGGCGCGCGGGCGGGGCGGGGCGCGCCGAGGGGGCGTGGCGCGGC 
GTCTGCGCAGCTGCCAGCGCCTTTMGCCCGGGCTCGCGCTCTCGGACCGTGCTTTCGCCGCCT^ 
• 40 GTTTCTAGGTCCCCACTGTCCCCGCCGTCCCGCCCCTTCGCGTC 

CCAGGCATGCCCCGCTACGAGCTGGCTTTMTCCTGAAAGCCATGCAGCGGGTAAGTGACCTO 
CGC^GGCGCCCCCTGTGCTTGTCATGACCCCTATATTGAGGTTGGCAM 
TAAMTMTTMGGTACGTATATATAGTTTTATGACTGGATATGGCTTTGGGTATGM 
ATTTTTTTGT1TTCTATTATAMTGAGTAGAAAMC 
45 CAMGACTGGCATTGGGAGAOTAGMGGAGCTTTAGAGGCAGATGGGACTGGGTTTTG^ 
TTTMCCTTCTGGATCCTCAGTTCCTTCTTTCT 

GACCTTGGAGGTAGTTAAGTGTAGTTCTGTCATTAMGTAGCTTTGTGGTC 
TTTTCATTATTCCCAGTTTATTMTGMGA 

CTAGATCAMGCTGAAATACAGTTGCTTGTTMGTAMTGTT'TGMTTAGTG 
50 TATTGCCATGAGGGGTGTGGGCCTGGTTTTGCCAGCTTTTT^ 
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gamtct(^ttttttttttttgatgto 
. gttggctctcattttgtmtctagatattcagatm™ 

ccaccttgttmttggtcactggtttgcagtaggactgatttcagcm 
. ttcggattgttgggttgtatatgtgcatttaggctaamtatacatgtaggcttcaga 
5 tgacattcacagtccaottttctttcm^ 

TTGTATTMCCTCTTAGGTGTTTGAMTACTACTGGCATGCAAAGATTGCTTT^ 
TATCTGTGGAATTAAAMTAGTTTACCCATGATCCCTGGAAMTAGTAm 
GATTATAGAGGAGGGAGGTGATGA(^TTTTTATTTTO 
ACTMCCTTCCTTTTTO^ 
1 0 GGTTGTTCTTACGTACGTTTTGGCCCMGGTGTMTTGAC™ 
CCAGGTMGGTMTTTCTGTCCACTOTGGGCATAC^ 
AMGCACAGACTTAGCAAAACTTCTTTATTGGAGCCCTCMTC 
MTTCCTGGTTTTGTCTTTGTTTTGACTTGA 

TTTGTCAGGTGAAGAMCCMGTGCAGTTATGGTAGCCGGGTTGCTGTGGA 
15 GATCAGMGTGCAGTGTTGTGGTOCATTTCAGCTGCTGGGMGATGCTCTTC 

AMGGTGCAGMTCAGTGTAAGTTCCAGATATCTGCTCCCTTTTCATTTTGTC 

AAAACATCTGTTCAGTCTGGTTTTGATO 

CTGTCAGGCATGTTGTAGGAAAGGTGTATGGTACTGMGTGTC 

ATGACCCAGTCATGGGTTGTTTTGATTGCCATTCTTCTGTTGGTC 
20 MCAGTTGTATGMGTATAGTATCTTGMGCTTTMGACACCMGTTTCCATTAGGCAGCCTMGTCTCGCAGACTTTGGTAGC 

ACTGTTGGTCTCATGGGAATGCCATTCTCATGTGTGGACTMGTMGGGCCAGGAGATTTTGCAMTTTTGTGTGTGTAAGAAG 

ATCGMGAAMCATTTTTCCTMTGTCACTGTCTTGAGTATTTTATC 

CMGAmCTTGTCATCTGGCATACAMGAAMCAGTCGTGACCCTATTTCCCTGGAm 

TTTGCMTGCMGAGGAAMTCGAGCTGCCTTTATMTCTAGAATACCATGTTTCTC 
25 CGTMTTTCTACCAGTCCCTCCAMTACTGTTAMGTTACTCTTTTTGTGGTCTTACTTC 

CAMCCTTGGAGAAMTTAGMGGTTTTAMGTTTTCCMC 

CTATTTATMCTTGGCCTGGAGMTTATMTCTGGTCTACCAGGAAGGTC 

mgatgagcagmgcgattccttctagamcagtttttttctagamcataaaa™ 
aaacacamgttgacctggtaggatgttttacctcta 
30 ataatgcctmccttagcmtggattgmgtaggggtgcaggg^ 
cctacatattatacacac^cacatatgtacaattctc 

gmtggaaaaactgaagmttaamcacagacttggtcatacgtaatggtamtacagtagtttmtg 
ctatcactcagcttcmctgttaamccagtctgctcacctctgttttcm 
gctaacatgactaaaaamtttcagacttamgagcagttcatt^ 
35 mtatgccttattttatctttttttmtc 

cttcctcctmccacaacatagtcatcacattcaamcggacattgatacgat^ 

TmCGCTGTTCTTCCAGTAATAGTTm 
TGMCAGTTCCAMGCCTTTGTCATTMTM 
TCACTTTGGAGTTGACTGTAGTTGCGTTTTCATGATTAGAG 
40 GTCMTGCATCACACCGGGAGGCATACCACTTTGTTMGCAGTCATTTGGTTAGGTG 
TGTACCCTATTTCCCTTTTGTATTMGT 

GGTTTTAGCATCTGTTGGTGACTGMTCAGTTATTGCAGGAGTAC^ 
ATTAGTGTATMGAMTMGGACTTTTTTTTTTTTTT^ 
CGTGATCTCGGCTCACTGCMGCTCTGCCTACCGGGTTTCAC 
45 GCGCCCGCCACCGCACCCAGCTMTTTTTTGTAGTTTTAGTAGAGACG^ 
CTGACCTCGTGATTCACCCGCCTCGGCCTCCCAM 
TTTTAMCMCCGTAGTGCCTTTA^^ 

GTGTCTGTCTACAGTTTTGGTTGACTCTGMTCTAMCMGGTCCACAMT^ 
TGTMTAGCTTCTTACCCCACTCTTTACCTTTC 
50 MTTTCTGTAGACCCATAGGTTCTAGAMTTTGATTAGATTCAGGTTTTC 

TTGGGAGGGATAGATCACATCMGAGGGACACATGTCTGCATTTTAGAGAAAMTTMTC 
GCTGCATGCGTCCATTTTGAATCCCCCAGCAGCCTTTCTCCTC 
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ttcagggcttaccggtggtgattttcctcattacattagtcc 
cctagtttgtgtttgtgtgtgtgtgtc 
tttatttcaaamggatgttcatgmctcccttagtgatcctctcam 
tatcatgtctgcctcttggggtttttgttaggatt^ 
5 tggtamtgctagttactmggtmcttctttagttttgtgggm 
cttcatgagttgacatcagggtgcttgggtttctgttg 

cagctgtgtgggaacagttgccagacagacacgcacagggmggtctgcacatatcactgcaca€tm 
gttgcagcagtaamcttgamcacgttccgcagtggagttagcaca 
gttgagggagatggtggctccagatacaggtttttac 
. 1 0 agtccagtttgccttgtcattacacataccctcmctggagctatttgagattgcaga 
tttttccctttagcctctttmtctgcatctcttgttg 
cagtc™atcxk:gmgmcttagttgttmtatttgatttttto 
tgttctggacactaggacacatcagggmcmcacagaamttctaccctcatggagtttacata 
aagaaacgtttaagttttatagcgtgttaggtgttgagtactatggatam 
1 5 gggaagttacttttgtgcaaagacgtaamggagaggagacaggtcttc 

. gamtagccagtcccagaggctagtttggtccattgtgatgamgggttgactgctto 

AGGGCAGMGCATGGAGACMGAGGCTGGGACAGTGATCTGGATGAGATGGMTGGTGATTTC 
CTTTGATTCTGGATATATTTTMGTATCGCCMCAGGCTTTGCTGATGGATTGA 
GTCAAGGATGACTAAMTTTTTGGACTGAGCMTAGTGAGGATGAAGTO 
. 20 CATTTTTTGGGCMGAGCAAATTATTCATACCCCGGMCAACAGGCM 
MTTTTACTGTGAGMCCCTTGCATTTGTAGATTGTACAOT 
AGATTGGCTTAGATAGMTCTATAGCTGTGATGACTGGGTCMGTATAGM 
TGGMTGACTMTTMCACTGATAAAMGMTGMGGCAGAMGCGTTG^ 
TACAGCTGACCCATCAGGGCTTTGCTGTATGTGMGCCTATG 
25 ACACATCCATTCACATGATCACACAMGAmCMCTGGTTTTTATCTTACGTGCTG 
TTACATATCCATCTTGTGTATTTTCMGCACMCTTTTO 

CAGTMTGTCCTTTTCACTGAAGGTAGTATTGCATAGTGATTMGGGCAAMGCTGTTTA 
CTGGCTGTMCACAGACAGCTGTGGCCTGGGGTGAGTTATACTACCTC^ 
TMTTGTACTTATCTCATGATTGTTATMC&ATT^ 
30 AGGTTAGTGTMGATTTTTTTTTTTTTTTAAAm 

ATAGTGTTGAGCCTCTTGMTTCTTATTGAGTTACACAMTGTT^ 
TTTCATATTTCCTTTTCTTTTTGAGACAGTCTCGCTC 

CTCCACCTCTCTGGGTTCMGCGATCCTCTTGCCACAGCCTCCTGAGTAGCTGGGATTACAGGCGTGTGCTAC 

GATTTTTGTGTTTTI^TAGAGATGAGGm 
35 CAMGTGCTGGGATTACAGGTGTGAGTCACCACGCCCAGCGTGTTATTTCATGTTTO 

GGGAGTTTGCTCCTATATGTGATCTAGTATCATTCACATAAMGTTAAMCCAGTAGTAGTTTCA 

GACTGCCATATCTACMGGTCTTATTTCTCTGAGATTTCTATAAAAAMTTC 

TAAMTGMTATATCTTGGGCGGAGTGTATATGGATGMG 

TTTGAGGAMTGGTTTACTTTGGAMTTATCCATTCTTMCCCCTCCCCACTC 
40 TGTTCTGGTGGTAGAGGGTTGGMGGAGGGCCGCAGTCCCTCCCTACTCTGC 

ACCATTTGAGTCCMTCAGGGTTTTGTATMTTAATMCTm 

AMGATTTMGGGCAMTTCTTTATATTTTAGGGATGGTTAAMCATTACGGTACAGG 

GGGACCAGMGTGTTTTGGATTGTGGGATATTGCATATTACTTGGTGGCTG 

TGCTCCAGTGAGCATTTCCTTTGAGCATCATGTCAGTTCTC 
45 TTGCATTAGGGATACTTAATCTGTAGTGTTGTCTTTAGCTCCACTTACMTTTA 

AGGAAAACCTGTTTTGCTCTTTGTGTTACTGATGMC 

GACAGTCCTATGTGAGTACACATACACTCTGCATCAAGCGTTAGTAGTCTATTCCACCACAGGAATT^ 
AGTCMGGTGACAAMGTGAMGCATGTATGTAMGTCTTMGMTTTTATATACA 
GATTTMGTAGGAGCTMTAGTTGATGAGACAGTTGGGAGTCTAAMTGTTTO 
50 CTAATCTTTCTATATATAMTMCTTGGGA 

TCACTOTAGTATATTCAGGTTGTAGGGAGTAMC^ACATTTACTGAGCCAC 
TCTGTTACTTATTCCTTATAMTCATAAGGAGCAAGTTGGTTT 
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GAGATGMGTTGTTGCCTGAGATCCTACAGTTAGGMGTC 

tgtaccgtgccacctacatagatgtggtacctacacccacctcccttcm 
gaamcmgcttgggtatgcccatgttttttgtttto 
ggtgtctgtgtgttctatmcccttcmttat^^ 
5 tcmgctoccattgacagagtgctggactaggcgtctcagattgtaccttgagtgtgt^ 
amgaggtggttgtttggtcacgtttmgattattgcgtactactctto 
gmgccctgagtataggagataj^tttttcttg 
tagagtgcccatatatcctgctgtttacatatagttccagmtaagtto 
agtactagagagatctggtcatgatmctmttacattttggttmcatggttatatcacctgaagga^ 
10 ggtcttacctttttttctctttct^ 

ttgcmtttmtttatccagatgcgtggmtagm 

agmggatgccttctcttttgggmgaaggggmtctgctctgtttcamgcacctttgtgttcmg 
catcaccactaggggactgtttctttttgc^ 
cmggmcccaectacatgcttttcataj\ctatacctattttattgtcttatm 
15 a(^tgacaaamgcatatataaatataaaaatacmgttaaamgcctcatgttag 
aaamtmgggaaccctggacagtgctcattgtattttgttgmgam 
atatatatmcatactgtatattcagmccatt^ 

atgactagagtgtaamtatagctgcattgcattatagtattttatatatttgtacamccamttc 
tttttccttttccagtctttacca 
20 tggtatggtatccagtgagaagggmctgtccgcagtggctttgmggatatgaggaagcctaggcggagagto 
ccttgagctagtgttagggtactatttggtacctctcctc 
atgagmcttcatgcagtttgagcattcatcaggcagcagmtatagcttctatatto 
gmgmcmtgttactgatagtaacagtgcttcccagtgattgag 

camtaaggtgcagcagctagtatgacacatmcccaccgtatttcccccccmcggccagatgaaaccagaaatacccactgt 
25 gtttttctctcccttctttc^ 

aaattgamcaagtggcctgaggatgacttggatggctgttgggtagcagagagccctggaggcaggg 

agcagttttcattgtmttgacacagcagatttcattgtmttgacmg^ 

tamtmtgtttactagtcatctgattagattttattaaamtttagatgattcgtcctacataatgtc 

amcttggmgcttttgggggtggggmggtggtcatagamtatgatcatct^^ 
30 cctcmcttttmgatttttot 

gatttamtcacttttcatagcatmttattgtatt 

caggcagttttttctgcattagacctaagaacctmttttgtagggcag 

gtgagmgtgtgttgtamgtttctmtactgtggtctgttttgaacggttttg 

acttaamggaamtgcctagaatcatcagaaamgcatggaaamcattcttctctgaggttc 
35 /rcatttagaagttmtgtgtggaaggcattaggttgmtaam 

tgaamtttagtatttgtctctttttcaggagagmg 

tttgcagagtgattatcttmtccagmctgaaamttcctgatggtcctac 

ttttgttgaggtttggcggacataatmtamtcattttttagtc 

cttatacccttcmcaaamgcattaamggaamcataamtcacgcctttagtcagtgccg™ 
40 tggamtactgcctgatgaamggmgagmggamto 

tggcagamtmtacmtttttgagttgttttgtmgttgg 

agggggcgtctcacmgatctaggttactgtatgacmctatatccatttactggagtttmgm 

ccagagaotaggcagtacttggttttctgtatttamttttc 

ccatccaggaccctmctggccatamgttacccccamctctcct^ 
45 tattacgmtataattagtgttttctcccctttag 

gacmgamgggcmtgctgtggtaggcagttagctgctgmgttaggccacttagttatgagg 

gctcaamtgatmtattctcactgacaamgtatcagctgtgatcatatagattgctctttcctc 

aaaamttttttttcttccmtamgmtgtctcgttgcgcc 

tcctctacttcaatctttgtcttttc 
50 aatitctcctccagcacctgagtamtgctgatggtcttgtggaga 

mgmgcggaamtttamctgtcttcttc^ 

gaccctgctgtgttgctctgtgtagtccagtc 
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GACACTTCTGTCATTGGAGCGCTATTATTCACMGTTACCAGAATGAGAGCT^ 
CCTGTATTTTATCCTGGTCATGTGCATTGGTTTTTTTGCCATC 
GGGGCGCTCTATGACCTGGGTMCMTTGGTGCCTCTCTGTTTGTGAGCMTATTG 
ATCTGGAGCTGCMGTGGATTTGCAGTGGGCGCATGGGMTTC 
5 CCCAATTTACATCCGGTCAGGGGTATATACCATGCCTGMTACTTC 
TGCAGCCTTGTCTCTGATTCTCTATATTTTCACCMGC 
GGGTTGGMTCTTTATGTGTCTGTCATCCTGCTCATT^ 

CTACACAGACACTCTGCAGGCTCTGCTCATGATCATTGGGGCACTTACACTTATC^ 
TGAGGMGTTMGAGMGGTACATGTTGGCCTCACCCGATGTCACTTCCATCmTTGACATACM 
10 TTGTMTGTCTCCCCTMGAAAGMGCCCTGAAMTGCTGCGGM 
TGGGCAGACCCCAGCTTCAGTATGGTACTGGTGTG^^^ 

TGCCAMGGCTCTACTCTTATGGCTGGCTTCTTAMGCTCCTGCCMTGTTTATCATAGTTGrc 
ACTGTTTACTGATGATATAGCTTGCATCAACCGAGAGCACTGCATGCTG^ 
TTACCCACGCCTGGTGATGMGCTGGTTCCTGTGGGCCTTCGGGGTTTMTGATGGCAGTCATC 
1 5 CTTAGACTCTATCTTTMCAGTGCCAGTACCATATTCACCCTCGATGTC - 
GTTMTGATTGTGGGGAGGATATTTGTGGCATTTATGGTGGTGATCA^ 
AGGCCAGATGTACCTTTACATTCAGGAGGTAGCAGATTACCTGACACCC 

GMGCGCTGCAATGMCMGGGGCTTTCTATGGTGGAATGGCTGGCTTTGTTCTTGGAGCAGTCCG 

CTACCGTGCCCCAGAATGTGACCAACCTGATAATAGGCCGGGCTTCATCAAAGACATCCATTATATGTATGTGGCCACAGGATT 
20 GTTTTGGGTCACGGGACTCATTACTGTMTTGTGAGCCTTCTCACACCACCTCC^^ 

TTGGTCTAAGAAGAACCTGGTGGTGAAGGAGAACTGCTCCCCAAAAGAGGAACCATACCAAATGCAAGAAAAGAGCATTCTGAG 
ATGCAGTGAGMTMTGAGACCATCMCCACATCATTCrc^ 

TGTTAATCTGTTGGTAACCTGCAGAGAGGAGGGCMCCCAGTGGCATCCTTAGGTCAT^ 

TTACTCCMTGGGCMGCAGCTCTCATGGGTGAGAMGAGAGAAAGA^^ ' 
25 AGACTGGTTTTGTGGCTTTAAMGTMGAGCCTCAGC 

GATGCTAGMGAGACTCGGCMGTTAAAGTMTACTAMTATTGGACTTTTTGCTGTGTGTTCAC 

TTATTTCTCCTTATGAACTTMGGATATGGTGAGA 

ACTGTGCATCTCTCAGGCATTGTTTACGCTGTAGGTTOTAGCCAMTCT 

TTTTCCCAGAGATGGATTAMGTAMTCTTCMCTTM 
30 TTTTGCATACCAMGTAAGAAGAGACCMTTATTCTCACAGAGCACTTAGAGCAGMTATATGTTMGTTACC 

TATACTGTCTGCACTGCCMGTCITCGCAGACCTTACCCTGMGTAGMGATTTGCTCATTTCTAM 

GTMTCCCTCCTACCATTAAGAAAMCmTTTCTTAGACATTGTACMTCAGTTATGTACTGAAMTCG 

ATACTTGTTTCAGGACMGTTCATTO 

TTTCCTGTAGGTATTTTTGTACCACCAGTATATGGMTGTTAGGGAAAMCTTTGTTCCA 
35 TTTCMGTTTMGTGAACCATACTGAMTGACCMCAAGTCTGCCTGTAAAGTTACATGTCATGA 

TGGGGGAGAAMTGMGTAMTGTTGCTGATGATCCCCATAm * 

C^GCCCCCTMGCAGTGTTTGATTMCTTATGCTMTCAGATGATTACTCATATAT^ 

TATTTGGAAAMTTATTAGCCAMTGCCTTCCTAGGTGGATCCAGTTGGMGATATGTCCAG 

TGCCTTTGTGTGCTGGATTGCTCTACTTGATTAGATC 
40 TCTTATTGCATCCTTGATMGTTTTTCCCTGATTTTT 

TGTAGTGACTTAGAGCATMGGATGTTTCAGTGCCM^ 

AGGTTGTCMTATAGTCTTTCTGTAGGATGGATAGCATGTCT^ 

TGTGGAGGGTATTACAGGACTGTGTMTTATAGGACTCTMCTTGACATGGCTTGGCA 
. TACAAMGATGTTAGAGAAMGCTCTACAGATTACGTACTTCTC 
45 AGATGMGGMGCAAATTATGTATGTACTTTCTTTC^ 

TTGTATTTAGGGAAATTACTTTCATAAATACTTTTC 

MCTAGTTTCATTATGATGGACTTGATTAGTCCAMGTTMTTTTAGA 

AGGCTTTCTGAAGGACTGAGTCTGTAMTGAAAAMTMTTTATGTATGMTAGCATG 

GTAGMTTTTTCTCAATTTTATTCTTGAGGTTTATMTTO 
50 GAGGCTGCGACATGTCCMGGTTATGMGTCTCTTTO 

AGATGTGTCTGGACAMTGGTTGTCMTGTTTTGTCCTC 

TTGGGAAAGTMTMGATCTTGGATTTTTCAMTTMCATTMGTTGTMGMCTA 
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AATTCACTGATMTTGACATATTGGCTGGGCAGCC 
TTTTTCTCCCTTTATTTAACATTGAGTC 

GATACATAAGGTACATCATCTTGTGTCTGTGTGTATATAGCAGTAGGTCMGTTTAG 
ACTATTAGCATATTCATTAGMTTGTTTATO 
5 TCTTATTGCTTTCCAGTAMTAGATMTGTGCTC^ 
MTCTATATTTCATTAGMTGATTGTCCCTG 

GACTAAATGGGTCCTAMTGATGACATTGGTCTTTAGACATTMCATGTGTATATTTTTA 
ATTGMGCTTGATATTGACTAGMTAGCTAAAAGTCAAMTGAGGTGAGGACACTGGTCTTC 
CTTACCAGGTGTTAATGGTATCCCCAGTTCTTAGACTTTTGTCTTCTCAGGCM 
1 0 TATTACATTGGTATGCAGGATGATTATTGCATA'ITITGTGGGACCTC 

CCACCCCCAGAGAGTAMCACTTGAGCCGATTTCTTCTTCCCCAGCTATTCTTTCCTGG^ 

TAGAGMGCATCMGCMTAGCMTGGTGCTGTGTCCTTCGGCCTAMTTCMTO 

TTGGCTCAMGGATCCATTGTATTTTGGCACAMGAGCCTGGCCAGGGTCATGTAGC 

AMTTAGCTC^GACCCATCACTCTGTGAMCTTCACATTTTMGMCTGA - 

15 ggctmgcmggggttmctcttgtgagagccmtagagtgtgtctgtact 
atatggatcttcccctctgactttgmtatcatt^ 
tattagctactcagttacttgctactcamggttaggtct^ 
atgatgattctcctggctcattttcatcagag 

TTTCCTAGGTTTGGATAGAGAGATGTATACMGACCTTTCCTGTTAAATTACGTGAC 

20 ctmgamtcagaaamtgattmgtgagataagtacctgggtgacacagatattagcccgttggtaaaagacaacaaatatta 

gcttaamtctgcatatgtagaatcattttcattagatttagagcttc 

catatamtcagggcaccmtamtmgtttcagctttttamccctggtttgatgttm 

accacagtagagatmtttagtagaaamtgctttgaggcttcagtatttgtmgatm 

gmggcatttcagtgttgatmtagcctgagcagacttctttacamtggg 
25 attcagagagactggtctttctctttgtctcccttcac 

tcamtttggattttttggtcagcttagttcactttmggcatattggcatggtgtgtg 

aggacttttagatccamtmtgactcattamtatmttatgttttmgtatactgm 

ctcaggmtgatttcctcacacmgtgttggctmtmtaamgcactgttttatcctcaam 

gagagagcagtagtgatcatttatgtgagcccctttgam 
30 amggtttttttgcagtaamgtaaamtttggmttagttggcatatamggmcc 

attgtgaamggtmtctttcgattgctagacttggttaacttagggctc 

ggmtgtttttgtcagactgtcctttgttggaatacm 

tcamgtmttmctggctttgccagtggtgagtc^ 

ttcmtgctgagaatmggctttaaattactgattc 
35 ccatctamcttctttataaaaagagggattagtttttttgttttggggtmgcacctm 

accattggcatatatagtctttcactcagaaatamcaaamctc 

ctttttgggcagtaccatacatagtctgaggctattgacttamc 

acctgggttcacagcttgcttgmgagamggatgctagmtamgtmgcagctgaagagcgag^ 

gtggtctcagatttttcgtagtgtgggmcagtggttttgctctataccactgaa^ 
40 tactgaagcttttcccctcacttctac^ttgtttacattcagagctctatcmtm 

mccgcacmgttggcagtaggtatccccmcctmtttatcttg 

gtgtotacttttoattgctcttagacagagtagtct 

tgttcattagtaactcatctttttgttgttatmttggamcagaaacga 

mggccagtatatatggtattccataatatmccagcttttgamtttatgtg 
45 gactctgctagtttgcacctttccgttcttm 

mgtaaaaaaaaaaaaaaamcatgcattacattgacatacmatgt^ 

ctgtatttgtgttgtagcctaaatgttgtttcttttatatccattaaam 

gttgtgtttccattgtaggttgataggtatatcgagmcaggtacgtgacmca 

catagamgcacaamtcgtgttcacacattagtgtacccacacatagamgcacmgactaatagtal^ 
50 agtgccagtcatamggccaccaggtatttgtctcagagttgctatgagcactacag 

tctaaactggtcctmtggtmgggacccamggmtmtctcm 

gtcmgmtgccaamttatatttgggggttactagctaamtggggttoa^ 
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TAGGGACAGATGTCTAGGTTTTTCGTGGGTGG 
AAAAATCTTAAATCWTTAGGAAATATTACCTCTTAACAGTGCCCC 
ACTACTTGTCTTTGATTTTTTTTGTGTACGTTTGTATGTGAG 
TCATTTTAATAAAGGATAATTTGATCTGA 

5 309 

MSLQRLLQHSSNGNLADFCAGPAYSSYSTLTGSLTMNDNR^ 

NETFGFEIQSYRPQNQNACSSEMFTLICKIQEDSPAHCAGLQAGDVLANINGVSTEGFTYKQWDLIRSSGNLLTIETLNGTMI 
LKRTELEAKLQVLKQTLKQKWVEYRSLQLQEHRLLHGDAMCPSLENMDLDELSLFGPLPGPGPALVDRNRLSSESSCKSWLSS 
MI^DSEDGYQTCVSEDSSRGAFSRQTSTDDECFIPKEGDDFLRRSSSRRNRSISNTSSGSMSPLWEGNLSSMFGTLPRKSRKGS 
10 VRKQLLKFIPGIiHRAVEEEESRF 

310 

GCACGAGGCGAGCTGGAGTGATCTTTATTCACMTGTCTTTACAMGGCTC 

TTCTGCGCTGGGCCAGCGTATAGCTCmCTCCACACTCACCGGCAGCCTTACGATGMCGATAATAGM 

GCAGACACGGTGGCTACTCTGCCTCGGGGACGAMGCAGCTTGCTTTGACCAGATCMGTTCOT 
15 CAMGAMGCmTTACTGTGGAGMGCAGGATAATGAMCATTTGGATTTGAM 

GCCTGCTCCTCGGAMTGTTCACTTTGATATGCAAMTACAGGAGGACAGCCCAGCTCACTC 

GTCCTTGCAMTATCMTGGTGTGAGCACAGAAGGTTTTACCTACAMCMGTC 

CTAACGATAGAGACTCTTMTGGMCMTGATTCTGAAMGMCGGA 

AMCAAAMTGGGTGGAGTACAGATCTCTGCAGTTACAGGMCATCGTCT 
20 GAAMCATGGACTTGGATGMTTGTCTTTGTTTGGACCCCTGCCTGGGCCAG^ 

AGTGAGAGCAGCTGTMGAGCTGGCTGAGCTCCATGACGATGGACAGTGAAGATGGCTACCAGACGTGTGTGTCTGAGG 

AGCAGGGGTGCCTTCAGTCGGCAGACGAGTACAGATGATGAGTGCT 

TCTTCMGGAGGMCCGGAGCATCAGTMCACCAGCAGCGGATCCATGTCTCCCTTGTGGGAGGGCMCTTATCMG 
GGGACCCTGCCCCGGMGAGCAGAMGGGMGTGTCCGAMGCMCTCTTGAMTO^ 
25 GAGGMGAMGTCGCTTTTGACGGATTGTGGTGTCCTTTCAAATTAGCTTATT^ 

AGCTTGGTGGGAMGTGCAGMGMTTGCAAMCTGACATCCCATTTCACAGCMTAGTC^ 
TAGTTTATGCTTCTTAMTCATTTTTCMC 

TMCAAAGATGCAGGTTCCTGCTCATTCCAGTMTGTCTTTGAMGCAAMCTAATATTTATTTTCTAGATTATCCCTO 
MTTGAGMCTTTTTGGAGTCMGTATGMTAMGGTGTGGCAGMTATMTMTCTGGACTATTTTC 
30 GGTTATAAMTCTTAGGTTTGCTTATGCCCAGTAGCTC^ 

MTGGCTTGTAMCAMGATGACCATCAGCTGTTTCTCACATCT^^ 

GGGTTMGGTTCATGAGMCCATGGAMGATGTGGTCTGAGATGGGTGCTGGAMGATC 

TCTAAACAAAAAAAAAAAAAAAAAAAAAAA 

311 

35 MEQQDQSMKEGRLTLVLALATLIMFGSSFQYGYWMVNSPALLMQQFYNETYYGRTGEFMEDFPLTLLW 

GSLLVGPLVNKFGRKGALLFMFSIVPAILMGCSRVATSFE 
. FIWGILVAQIFGLRI^LANVDGWPILLGLTGVPAALQ 

EDEAEKMGFISVLKLFRMRSLRWQLLSIIVLMGGQQLSGVNAIYY^ 

ELLGRRLLLLLGFSICLIACCVLTMLALQDW 
40 LSNFWGLIFPFIQEGLGPYSFIVFAVICLLTTIYIFLIVPETKAKTFIEINQIFTKMNKVSEVYPEKEELKELPPVTSEQ 

312 ' 

CTTCTCTCTCCATTCAGTGCACGCGTTACTTTGGCTAAMGGAGGTGAGCGGCACTC 
CAGGATCAGAGCATGMGGMGGGAGGCTGACGCTTGTGCTTGCCCTGGC 
TATGGGTACM03TGGCTGCTGTCMCTCCCCAGCACTGCTCATGCMCMTTTO 
45 GMTTCATGGMGACTTCCCCTTGACGTTGCTGTGGTCTGTMCCGTGTCCATCTTT 
CTGGTCGGCCCCTTGGTGMTAMTTTGGCAGAAMGGGG 
ATGGGATGCAGCAGAGTCGCCACATCATTTGAGCTTATCATTATTTCCA 
AACGTGGTCCCCATGTACTTAGGGGAGCTGGCCCCT^ 
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GTTGGCATCCTTGTGGCCCAGATCTTTGG 

CX3GGTCCCCGCGGCGCTGCAGCTCCTTCTGCTGCCCTTOTCCCCGAGAGCCCCAG 
GCGGCCGCCAAGAMGCCCTACAGACGCTGCGCGGCTGGGACTCTG^ 
GCAGAGMGGCCGCGGGCTTCATCTCCGTGCTGMGCTGTTCCGGA 
5 CTCATGGGCGGCCAGCAGCTGTCGGGCGTCMCGCTATCTACTACTACGCGGACCAGATCTACCTGAGCGCCGGCGTGCCGGAG 
GAGCACGTGCAGTACGTGACGGCCGGCACCGGGGCCGTGMCGTGGTCATGACCTTCTGCGCCGTGTTC 
GGTCGGAGGCTGCTGCTGCTGCTGGGCTTCTCCATCTGCC 

ACAGTGTCCTGGATGCCATACATCAGCATCGTCTGTGTCATCTCCTACGTCATAGGACA 
GCGCTGCTCATCACTGAGATCTTCCTGCAGTCCTCTCGGCCATCTGCCTTCATGGTC 
' 1 0 TTCACCGTGGGCTTGATCTTCCCGTTCATCCAGGAGGGCCTC 

ACCACCATCTACATCTTCTTGATTGTCCCGGAGACCMGGCCMGACGT 

MGGTGTCTGMGTGTACCCGGAAMGGAGGMCTGAMGAGCTTCCACCTGTCACTTC 

CAGTGGAGCTGGTCTGCCAGGGGCTTCCCACTTTGGC™ 

ACTCTGATGTGGMTGCAGTCCTCATCTCCAGCCTCCCCACCCCAGTGGGMCTGTGCAMGGGCTC 
15 CTGGGCTGTCTCTCTCCATGTTGGCCTGTCACCAG 

GTGTGGCTCCTGGTMCGTGGCTCCACCTTGATGGGTCAACCTTTGTC 

TGGTGAGATGGMGGMTCAMTTTTGCCAGAGAMCTMCTCGGTGGCCCCMC 

AGAGCCAMTTCATCCTCTTACCAGATCCTTTTCCAGAAATACCTGTCTAGGMGGTGTGATGTCAG 

MGCTGAGGAACAGGTTCCTGTGGAGACACTGAGltAGMTTCTTCATCCAMm 
20 TGTGTAGTCAATAAMTGAACCTGATCACTTTTC 

313 

MASVAVDPQPSWTRVVNLPLVSSTYD^^ 

KGLDRIEERLPILNQPSTQIVANMGAVTGAKDAOTTTVTGAKDSVASTITGVMDKT^^ 
MQLVSSGVENALTKSELLVEQYLPLTEEELEKEAKKVEGFDLVQKPSYYWIX5SLSTKLHSRAYQ 
25 LHSTVHLIEFARKNVYSMQKIQDAQDKLYLSWEWKRSIGYDDTDESHCAEQFESRTLAIAI^ 
QNIQDQAKHMGVMAGDIYSWRNMSFKEVSDSLLTSS^ 
EMDKSSQETQRSEHKTH 

'313 

MASVAVDPQPSWTRVVNLPLVSSTYDLMSSAYL^^ 
30 KGLDRI EERLPILNQPSTQIVMAKGAVTGAKDAVTCT SGS INTVLGSRM 

MQLVSSGVENALTKSELLVEQYLPLTEEELEKEMKVEGFDLVQKPSYYVRLGSLSTKLHSRAYQQALSRVKEAKQKSQQTISQ 
LHSTVHLIEFARKNWSANQKIQDAQDKLYLSWVEWKRSIGYDDTDESHCAEQFESRTLAIARNLTQ 
.QNIQDQAKHMGWMAGDIY^ 
EMDKSSQETQRSEHKTH 

35 314 

ATGGCATCCGTTGCAGTTGATCCACMCCGAGTGTGGTGACTCGGGTGGTCMCCTC 
ATGTCCTCAGCCTATCTCAGTACAMGGACCAGTATCCCTACCTGMGTC 

atcacctccgtggccatgaccagtgctctgcccatcatccagmgctagagccgcamttgcagto 
mggggctagacaggattgaggagagactgcctattctgmtcagccatcaactcagattgttgccmtgccam 
40 actggggcaamgatgctgtgacgactactgtgactggggccmg 

accamggggcagtgactggcagtgtggagmgaccmgtctgtggtcagtggcagcattmcacagtcttggg 
atgcagctcgtgagcagtggcgtagaamtgcactcaccamtcagagctgttgg™ 

GMCTAGAAAMGMGCAAAAAMGTTGAAGGATTTGATCTGGTTCAGMGCCMGT^ 
AC(^GCTTCACTCCCGTGCCTACCAGCAGGCTCTCAGCAGGGTTAMGMGCTMGCAAAAMGCCM 
45 CTCCATTCTACTGTTCACCTGATTGMTTTGCCAGGAAGMTGTGTA 

CTCTACCTCTCATGGGTAGAGTGGAAMGGAGCATTGGATATGATGATACTGATGAGTCC 
CGTACTCTTGCMTTGCCCGCMCCTGACTCAGCAGCTCCAGACCACGTGCCACACCCTCCTGTCCM 
CAGMCATCCMGATCAAGCCMGCACATGGGGGTGATGGCAGGCGACATCTACTCAGTGTTCCGCMTGC 
GMGTGTCTGACAGCCTCCTCACTTCTAGCMGGGGCAGCTGCAGAAAA 
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GTTMCMCACGCCCCTCMCTGGCTGGTAG6TCCCTTTTATCCTCAGCTGACTGAGTC 
GAGATGGACAAGAGCAGCCAGGAGACCCAGCGATCTGAGCATAAAACTCATTAA 

315 

MCLRGGCSPRAPAMPQPRPPPALPPRPRAPVPASRPGRPLLTPARPCGRMI^GSPGPRLGGSRGERRRPAGIU3PARVGPG 
5 RRPARPGPMWTETGQGI VHALTDLSI PGMTSGNGNSASSIAGTAPQNGENKPPQAIVKPQI LTHVIEGFVIQEGADVSRVJDAR 
LIjVGNLKKKYAQGFLPEKLPQQDHTTTTDSEMEEPYLQESKEEGAPLKLKCELCGRVDFAYKPKR 
• RVGLFHSDRSKLQKAGAATHNRRRPAKPVCHHLPRIPRSSQQALCPFRLLLLCVTHSQEDSSRCSDNSSYEEPLSPISASSSTS 
AGDKASGTWSSPTCICGTWWAWDTTSCQVSHQVOT 
LKIYARISMLKDS 

10 316 

ggcgccgcatgtgtctccgcggcggctgcagccctcgagcgcccgccgccgcgccccaaccccggccgccgcccgccctcccgc 
cccggcctcgcgcccccgtcccggcctcgcgccccggccgcccttottgacgccggccagg^ 
gcagccccgggccccggctcggaggctcccggggcgagaggag^ 
aggggctgcgtaggcccgcccggccaggcccagccgcctggacagagacagggcac^^ 

15 gcatccccggcatgacctcagggmcggaaactctgcctccagcatcgccggcactgccccccagmtggtgagmta 
cacaggccattgtgaaaccccamtcctgacgcatgttatcgmgggtttgtgatccaggagg 
acgctcgtctgctggtggggmtctcaagmgmgtatgcacaggggttcctgcc 
ccaccactgactcggagatggaggagccctatctgcmgmtccamgaggagggt^ 
- gtggccgggtggactttgcctatmgttcmgcgttccaagcgcttctgt^ 

20 gcaccaaacgggtgggacttttccactcagaccggagcaagctgcagaaggcaggagctgcgacccacaaccgccgtcggccag 
camgccagtctgccaccacmccmggataccaagmgcagccmcaggcactgtgccccmcggm 
tmcacacagccaggaagactccagccgttgctcagatmctcaagctatgaggmcccttgtca 
ctacttccgccggcgacaaggccagcgggacctggagctccccgacatgcatatgcgggacctggtgggcatgggacaccac^ 
cctgccaagtgagccaccaagtgmtgtagmgacgtctacgmttcatccg 

25 mttccgtgcccaggaaatcgacgggcmgccctgctgctgctcmgg 

ggcccgccctgmgatctacgcccgcatcagcatgctcmggactcctagggctggtggcaccaggattc 
tcctcccgactgagcagagccagacagacattcctgaggggcccagaaatggcggcgttggag^ 
catagctggtgaggaggtctgggcacctcctccatggctctcaggggcctttc 
cacagmgatggggctttatgcttgtamtattgatagcactggcttcctccamgtcc 

30 cctctttctgtcccccattttccagggggtatatggtcagggctccccmc 
ctggatggaggcctagaaagcccttgccttccttcctcccacttcm 
cccccttcagcctgtttctgcagcagccagggttctcccccctacaccctctgcaggtggagagag 
cggtgcctgctggccmgacgccttmcgctgtgtgtatgactgtgtgactgtgtgggagcctggactga 
gctactctotggcatctccaggtgttttgtagcamcagccacttagtgctttc 

35 mtagccmgmtggcagcctcagcgcagaggcmggtcagamgagacgg^ 

cgcactgtgmgttcccctgaccgccctcctggttcacamgagcattmgamgctgcggtggtctgagcmca 
cgtggagcctcctggcctgcctgcccgcccaccctgggagtccagtggtgaggctcagagmcttctm 
ggagtttctgttgatgtgmgmggcagctcttggcctcccactcccacacttc 
gagctacctgaggaggaggcagggcagmgggcmgggcctgcctctgacctgccgtgtcctttgcaggm 

40 tttctgagcttattctattccccacccacacccccaggcagggttggamtc 
aaaataaatctgtaaaatctgaaaaaaaaaaaaaa 

317 

l^PFYTRMTMIGEIAAAVSFISKFLRTKGLTSERQLQTFSQSLQELLAEHYKHHWPEKPCKGSGYRCIRI 
45 QRIGLSSQELFRLLPSELTLVA/DPYEVSYRIGEDGSICVLYEASPAGGSTQNSTWQMVDSRISCKE 
VSG 
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cctctcggacktggamtgcagcta™ 

GTGACCAGGCCGCCATCGCTCGTCTCTTCCTCTCTCCTGCCGCCTC^ 

GACAAMGTAGTCGTCCGCCCCTCACGCCCTCTCTTCCTCTCAGCCTTCCGCCCGGTGAGGMGCCCG 
GTCGGGGCCGCGCCGCCGAGCCCCAGCGCCCCGGGCCGCCCCCGCACGCC^ 
5 CCATGATAGGCGAGATCGCCGCCGCCGTGTCCTTCATCTCCAAGTTT^ 

AGACCTTCAGCCAGAGCCTGCAGGAGCTGCTGGCAGMCATTATAMCATCACTGGTTCCCAGAAAAGCCATGCAAGGGATCGG 

GTTACCGTTGTATTCGCATCMCCATAAMTGGATC 

TGTTCAGGCTTCTCCCMGTGMCTCACACTCTGGGTTC^ 

GTGTGCTGTATGAAGCCTCACCAGCAGGAGGTAGCACTCAAMCAGCACCMCGTGCAAATGGTAGACAGCCGAATCAGCTGTA 
10 AGGAGGMCTTCTCTTGGGCAGAACGAGCCCTTCCAAAMCTACAATATGATG^ 

GGATCATCTGATGATGATCCATAAATTTGATTTTTGCTTTGGGTGGGCTC 

CACAGCTGTGMGATCTGGCACMGATAGAATGGTAAAAAAAAAAAAAM 

TTTCMTGATTMTTTTMTAGCCTGTGAGTCCMGTAMTGATCACTTTATTTG 

GTTTCTCCCAGACATACCTAMTTTTTACATCAATCCTTT^ 
15 TCGCAGGGGMTTTGCACTATTACACTTGAAAGTTGTTATTGTTAACOT 

AMCATGGTAGTACTGGAMTTTTACMGACTTTTACCTAGCACTTAMTATGTATAMTGTACATAMG 

ATGACCTGGGGAAATGGTCAGACCTTGTATTGTGTTTTT^ 

TTTAAAAMGACCCTTAAAMGACACTGTCTCMCTGTGGTGTO 

TTCTMGACTGAGTAMCTTCTTATTTTTAGAMGTGGAGGTC 
20 TTCCACAMCCACCATCTATTTTGTGMCTTTGTTAGTC^ 

CATGTATATTGATTGTGGCAMGTTGTACAGATTTCTATATTTTGG 

TATCTTGGCATTTTTAACC 

319 „ 

mtkmvmydfaaepgmeltvnegeiiti^ 
25 astaqasssmsnmqvgsgjtopwsawsasksgl^essegwgaqpegagaqrot 

wixspksssyfkdsesadaggaqrgnsrassssmkiplmfpgfmpgteqyllmqlmpk^ 
wadprkgskmyglksyieyqltptntnrsvnhrykhfd^^ 

rmcrhpvisesevfqqflnfrdekewktgkrkaerdelagvmifstmepeapdldlveieqkceavgkftk^ 
qehtorctgplpkeyqkigkalqslatvfsssgyqgetdlnm 
30 diigthkgaiekvkesdklvatskitlqdkqm 

JjSRFPVM 
320 

cgggagacgagccggccgtcccgggccgggggacccgcccgccatggccaccmggctcgggttatgtatgattto 

acctggamtaatgmctgacggttmtgmggagamtcatcacmtcacamtccggatgtaggtggaggatggctc 
35 mgamcatcaaaggagaacgagggctggttcccacagactacgttgam 

tggamttcagtggctgaccmgccttccttgattctctctcagccagcacagctcaggccagttcgtcggctgccagcmcaa 

tcaccaggttggcagtg&cmtgacccctggtcagcctggagtgcctc 

ggcccagccagaggg(^ctggagcccamgamcacamcactcccmcmctggga(^ctgccttcgg 

ccmggaccagcaactggtgatgatgatgactgggatgmgactc 
40 gtcagctgatgcaggcggcgctcagcgaggamcagtcgtgctagttcctcatccatgaamttc 

atttgcgamcctggcacggmcagtatttgttggccamcaactagcaamcccam 

ttatggcccmtgtgggtttatcctacot 

aaagagctacatcgmtatcagctmcacctactmcactm^ 

tgagcgtctcctggttaagtttgggtcagccattc 
45 atttatcaamtgcgcatggagagacttcaggcctc^atgaccaggatgtgtcgcc 

ccagcagttcctamtttccgagatgagmggmtc^aamctggamgaggm 

gatattttccaccatggmccagaggcacctc^ 

caaggccatggatgacggcgtgmggagctgctgacggtggggcaggagcactggmgcgctgcac 
atatcagmgataggamggccttgcagagtttggccacagtgttcagttcc^g 
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tgcmtmcagmgcaggamgacttatg^ 
ggmtgtmtcacgagtatamggttttcttggctgcttccctgacatca™ 
agamgtgacamctagttgcaacmgtaamtcaccctacaagacaaac^ 
cgcgttgcmgctgagatgmtcactttcacagtmccggatc^ 
5 agtgcaattttacgamcgattgcagaamgctgaggcaggccctcagccgctttcc 
agagmtgccgcgtgctttctcctgacttggggcmtgcmttcaaaactttttt^ 

AAAACCAAAMGAMGAGTTGCAAAAMCTGCATTTATTTTATTAGCCACCCTAMTC^ 
GTTCATTTCCGCATCCATTATTTAMCCAGTGGAMTTC 
GAAMTGAGGGGTTTCTCCCCACTGATATTTTACATAGAGTCATMTTT 
10 TMGTCTCCCACAATCTTCCAGTTCTTACCCAGTGTCAGATAATTMTTACTMTTACTTTCTTAAAM 
AATAAAAAATATCTATGTTTGTATAAAAAAAAAAAAAAAAAA 

321 

MKDLGAEHLAGHEOTQLLGLLWY^^ 

DRFLALMKVKPKHLSCICTCSFLLMRIVEEDCNIPSTHDVIRISQCKCTASDIKRMEKIISEKLHYELEATT 
15 ILCHTSERKEILSLDKLEAQLKACNCRLIFS 

AEYSSPECCKPDLKKLWIVSRRTAQNLHNSYYSVPELPTIPEGGCFDESESEDSCEDMSCGEESIjSSSPPSDQECTFFFNFKV 
AQTLCFPS 

322 

ccggtcggatgccggaccgggggcacccktgaggcggtgggtccccgacctc^ 
20 gtccgtgcggaccgcgaggccgcgggcgggtc&aggcgcgtc^ 

CATGAAGt^TCCAACTTCTC^ 

TTGATTGAGGCTACCCCGGAGMTGATAACACTTTGTGTCCAGGATTC 
MCTTTTTTGGATCTTGCACTGAAACTTTTGTCCTGGCTGTCM 
AMCATmTCTTGCATTGGAGTCTGTTCTTTTTTGCTGGCTGCTAGM 
25 GATGTGATCCGGATTAGTCAGTGTAAATGTACTGC 

GMTTGGMGCTACTACTGCCTTAMCTTTTTGCACTTATACCATACT^ 

CTGAGCCTTGATAAACTAGAAGCTCAGCTGAAAGCTTGCMCTGCCGACTCATCTTOT 

TTGTGCCTTCTCAATTTGGAAGTGGAMCTTTGAM 

ATTAATGACACTGAGTTCTTCTACTGGAGAGAGTTGGTTTCTAMTGCCTAGCCGAGTATTCTTCTC 
30 GATCTTMGMGTTGGTTTGGATCGTTTCMGGCGCACAGCCCA 
ACGATACCTGAGGGGGGTTGTTTTGATGAMGTC^ 

AGCTCTCCTCCCAGTGATCMGAGTGCACCTTCTT^ ' 
GATTGTTCTGTCAGAATTTATATTTACAGGGTTTC 

GTCAGGMTTMTATACTGGAATACCTACCTTCTATTTGTTATTCAGATCAGATCTG^ 
35 TCAMTGGGGTAGTGCCTCTTAMCCATTMCAGTACm 

GGGAGGAGGGMGGTGCTGATACCTTCMTTTGTTACTTTTCAAGATTTTTAAAM 

TATAAMCC3TCAGATGTCTTTMGCAGATTGGMGTATGC 

ATCATAGATTTCACCCTCCCCCCTTCTC^ 
' ATGATCAGTGTCTATTTGATGTGATGCAGATC™ 
40 ATGTCTTMTTGAGATTTGTGTTMGGC^^ 

MCTATGGTATTATTTTMTGTTACTTTAAAMTCCATMTCTGCTAGTTTTGCATGTA^ 

TGAAMCTCAGTATCTATGGMTTGATAMTGTTGATC 

TGCATGATTACATTTTATTTGCCTTTGT 

.323 

45 MPPHPLNKHTHTHTHTHTHTHTHKMLLG^ 

SCHSAKSKNPGKVYKMTLDPVKKQVSPELAMQ 
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aggcagtgtgtggcccagggamgctgcgtttcat 

gccattcctgcmgtgcccctgcttcagctctcaggcccatccmtgcctcgagcccacgggggtgtcccggcagg 
atcacgacccagtmcgctgctggctgtgtcctcatgcrc 

gagtgtgtacaggctgmtgttgagtgatgcccccccaccctctcaataaacacacacacacacacacacacacacacacacac 
5 aamtgttattaggcacagcagctccatcctgctccctgag 

agctggtctaccctgtccccgcacacttcacaggcactccactgcttggcaggmttctg 

cagggtgaaamtcmgctgtcactctgcaamtctaaaaaccctggamggtc 

cmgtgtcacctgagttggccatgcmtgaatgmtattataagcagtattggagm^ 

ttggamcattcgagttcatacagtgcatgac^ 
10 gatgagtacatgaatagtgatgmtagccctttgtgctc 

tgagttccattactggtgattgmgtattttcaggagcagatacatgtggtgcatc 

tcacccaaamgtgcagmcctctgamggtgagccgtggtgcaggtgcccccctcgctgttc 

tggcctgtcacacgccgggtctcaacmcatcgtgcgaggagacagtgactggtt^ 

catcaggmgcctgtactgtcctgatcccaccaggcttgggttmtttcctattgc™ 
15 tttgagagagctctcatgttaaamgaggttgtctttgcactgcgcttggtatgattrc 

ctagmtgtgcctttcgtgmggtctgacamtggggtctcagagtcctaggacccgcaccaaggtcagagtc 

gctgagcatttctacatgcacccagggtgctgggctgaggctmgggtggacgcgtagaccccatcacaggcttc 

cccaagtcccccatttggtcactatcccttctcagagcm 

tggcctggagtggcctttccctcagctgcctgtc 
20 gagtttctattcattgagacamcagtgagcgggcaccmcaggccagcagcttggcc^ 

tggagtctgggcagctgtctgcctctctacggctctgcttcccagcgggcacacgggtccggagcctc 

gac^cgcagggcacacctaggagmgctcaatgaccaccamgcccttgaggagagacggggt™ 

mcccccaggaccatgcacggctctggcagtgcacagtggctgattttcaccagttcctcctacccctgcctctgtcggagact 

agaamcattctaccccttcaggtcttcttttaamgaaaagagctatggcamgc^ 
25 gtggctttgcgttcttgaccatggccacaattctggtcacttctagcatctgaccto 

cgctggcctttgagagagtctcactctccagctagctctttgtgtagtgtccagggccccggcagtcaggatagag^ 

agmgatggacagtggacagggcgcagtggctcacgcctgtmtcccagcgctttgggagagcgaggcgggcagatcacaaggt 

cmgagatcmgaccattctggccmcatggtgamccctgtttctactaaamtacaaamttagctgggc 

gcctgtaatcccagctactcgggaggctgaggcaggagmtcgcttgmcccgggaggca 
30 gccactgcactccaoxxtgggtgacagagcmgactccgtctcaaaaaagatggccatc 
' atatctgtagtaggatgamggmtctggcctagggagccccggactcggtttacctaccccgtactcatcccactcatctgtc 

taccctagtgtccatcatgccctggatgmtcataggtgggagggacagmgccgagaacaccaggcacacggggtggggggaa 

tggggggagcc€aggtgatcccctmgamgtggattgatcctctcccctgtggmtgacagcttm 

gggcmctattttatgagcatttttamcagamggamgatgatatcccag 
35 ggcccctcgcctctccctctcccccagmccttggccagaattga 

agggctgmggtggaagtagggccactgccgcagcttcagccagagccctcgctgcttcgcagmtttmcmgcacacgactc 

ggattccttogatatgagamggmggcmgtgcgggcamgggmgctcctc 

tctttcccaggtttgcacaatcacttctggcamttmttgmggtccttgmttcagc 

. ttctgamctgtacttcttccctattctgtc^ 
40 ctgttcagtttgttaaaaaaaaaaaaaaaaaaa " ' * 

325 

MGTSLSPNDPWPLNPLS IQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAP PG PFAWPLIGNAAAVGQAAHLSFARLARRYGDV 
FQIRLGSCPIWMGERAIHQALVQQGSAFADRPAFASFRWSGGRSMFGHYS 
HVLSEARELVALLWGSADGAFLDPRPLTWAVAN^ 
45 PWTWREFEQLNRNFSNFILDKFLRHCESL^ 

TALQWLLLLFTRYPDVQTRVQAELDQWGRDRLPCMGDQPNLPYVLA^ 

FVNQWSV^DPVMPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEEL^ 

FSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ 
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ACTCTGGAGTGGGAGTGGGAGCGAGCGCTTCTGCGACTCCAGTTGTGAGAGCCGCM 
ACCCCCAGTCTCAATCTCMCGCTGTGAGGAMCCTCGACTTTGCCAGGTCCCCMGGGCAGCGGGGCT^ 
CCTTCTCCGTCCCCATCCCMTCCMGCGCTCCTGGCACTGACGACGCCMGAGACTCGAGTGGGAGTTAM 
GGCAGCAGGTGTCCAGGCCGGGCCTGCGGGTTCCTGTTGACGTC 
5 TGTCCCGCGCCACTGGAMCCGCACCTCCCCGCAGCATGGGCACCAGCCTCAG^ 

TCCATCCAGCAGACCACGCTCCTGCTACTCCTGTCGGTGCTGGCCACTGTGCATGTGGGC 
CGGCAGCTCCGGTCCGCGCCCCCGGGCCCGTTTGCGTGGCCACTGATCGGAMCGCGGCGGCGGTGGG 
TCGTTCGCTCGCCTGGCGCGGCGCTACGGCGACGTTTTCCAGATCCGCCTGGGCAGCTGCCCCAT^ 
CGCGCCATCCACCAGGCCCTGGTGCAGCAGGGCTCGGCCTTCGCCGACCGGCCGGCCTTC 

10 GGCCGCAGCATGGCTTTCGGCCACTACTCGGAGCAC 

ACGCGCCAGCCGCGCAGCCGCCAAGTCCTCGAGGGCCACGTGCTGAGCGAGGCGCGCGAGCTGGTGGCGCTGCTGGTGCGCGGC 
AGCGCGGACGGCGCCTTCCTCGACCCGAGGCCGCTGACCGTCGTGGCCGTGGCCMCGTCATGAGTGrc 
CGCTACAGCCACGACGACCCCGAGTTCCGTGAGCTGCTCAGCCACAACGAAGAGTTCGGGCGCACGGTGGGCGCGGGCAGCCTG 
GTGGACGTGATGCCCTGGCTGCAGTACTTCCCCMCCCGGTGCGCAC 

15 AGCMCTTCATCCTGGACMGTTCTTGAGGCACTGCGAAAGCCTTCGGCCCGGGGCCGCCCCCCGCGACATGA 
ATCCTCTCTGCGGAAMGMGGCGGCCGGGGACTCGCACGGTGGTGGCGC 
ACTGACATCTTCGGCGCCAGCCAGGACACCCTGTCCACCGCGCTGCAGTGGCTGCTCCTCCTCT^ 
CAGACTCGAGTGCAGGCAGMTTGGATCAGGTCGTGGGGAGGGACCGTCTGCCTTGTATGGGTGACCAGCCCMCCTGCCC 
GTCCTGGCCTTCCTCTATGMGCCATGCGC 

20 GTCTTGGGCTACCACATTCCCMGGACACTGTGGTTTTTGTCAACCAGTGG 

CCGGAGAACTTTGATCCAGCTCGATTCTTGGACMGGATGGCCTCATCMCMGGACCTGACCAGCAGAG 
GTGGGCAAAAGGCGGTGCATTGGCGMGMCTTTCTMGATGCAGCTTTTTCTCTTCATCTCCATCCTGGCTCACCAG 
TTCAGGGCCMCCCAAATGAGCCTGCGAAMTGAATTTCAGTTATGGTCTMCCATTA^ 
ACTCTCAGAGAGTCCATGGAGCTCCTTGATAGTGCTGTCCAAMTTTACAAGCCAAGGA 

25 MGCTGAMTTTTAGAMTATTCACATCTTCGGAGATGAGGAGTAAMTTCAGTTTTTTTCCAG 

tcmttagcgtttmggtgagcatamtcmctgtccatcaggtgaggtgtgctccatacccagcggttc 
gctatgcaggagcttctgggagatttttttgagtca 

tatttctgmggtagcattctttggagttaamtgcacatatagacacatacacccamcacttacaccamctac 
gaagtattttggtmccaggccatttttggtgggaatccaagat 
30 camgtttcagagtatattgttgmgagacagagacmgtmtto 
tamgaccagamttcccttttcaccttttcaggaamtmcttagactc 
tggtatacttccttacttttmggatamtcat^ 

ggttccatgagttatcatgmttttamgtatgcattattamttgtaaaactccaaggtgatgttgtacctc 
aaagtacagmttoaattatcagcamgaaaaaaaaaaaagccagccmgctttamttatgtc^ 
• • 35 agtmgtctcataggttaaaaaaaamgtcaccamtagtgtgaaatat^ • 

ATCTTGTTCAGGAAAAGGTTGMTMTATATCCCTTGTG 

AAAAATGAGCTTGATTAMTCMCCACCTATTTTTGACATGGAAATGMGCAGGGTO 
TCTCAAMTTAGATCCTAAGATGTGTTCTTATTTTTATMCATCTTTATTGAM 
AAATMCCTMTTAATATATTAAMTTCCAAATTCATGGCATGCTTAMTTTTM 
40 GTTCCAGTTGMGTTAGTGGAMTCTGMCATTCTCCTGTGGAAGGCAGAGAMTCTMGCTC 
AAMTGCCATGMTTACCTGGATGTTCTTTTTACGAGGTGACMGAG 
TCTTCTAGATGATTTTTTGAMGTTMCATTMTGCCTGCTTTTTC 
GAAMGACAGTGGAGATGAGGTCAGTTGTGTTTTTTMGATGGCMTTACTTTGGTAG 

ATGTATGCATTCACATTTAGAAMGTGMTTGMGTTTGMGTTO ' 
45 AGTGTCCTMGTGCTAAGTGCTTATTACATTW 

ATAGTTGTGTGTATGTGTGTGTGGGGTGGGGGGATGGTMGAGAAMGAGAGAMCACTGAAM 

TTTTCCCACTCATTCTGMTTMTTMTTTGGAGC^ 

AGTTAMTCTCAAATAGGCTATTAAAAMGTCTACMCATAGC^ 

ATTTTATTTTAAMTTTCATAGCTGTACTTCTTG 
50 AMGCAGGCTTGCCCAGTACATTTAMTTTTTTGGCACTTGCC 

TTGGGCTGCTGTAGCCTATTTTTTTAGATOAGAMTGTGTAGCTC 
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TCMCCAGGTCCAGATGTGCTATAATCTGTTTTTACGTATGTAGGCCCAGTC 
GTTTATATGGMGAMGTAAGGTGCTTGGAGTTTAC 

ACAAAAMCGMTGAAMTMCTGMTTTGGAGGCTGGAGTMTCAGATTACTGCm 
CCGGAGAGAGMTGTATTTGCTGACMCCATTAMGTCAGAAGTTTTACTCCAGGOT 
5 ATGCTTCATTTGTATGTCAMGCTTTGACTCTAT^ 

GMTTTTCTAAMGCTTTCATGTCCCAGMCTTAGCCTTTACCTGTGMGTC 
TATATTAGATCAMTAGTTGCATAGCAGTATATGTTMTTTGTC 
CTTTAGTAGACATTTATMCTCMGGATACCTTCTTATTTMTCTm 
GTGCATAATAGCTACAGTGCATAGTTGTAGACAMGTACATTCTGGGGAMCM 
1 0 ACCAMTTAAAAAAAAATTGTATCTCATTACTTATACTGGGACACCATTACCAAMTMTAAAM 
AAAA 

327 

MTTTLVSATIFDLSBVLCKGNKMLNYSAPSAGGCLLDRKAVGTPAGGGPPRRHSWLPS 

RFRDRSFSEGGSRLLPTQKQPGGGQVNSSRYKTELCRPFEENGACKYGDKCQFAHGIHELRSLTRHPKYKTELCRTFHTIGFCP 
15 YGPRCHFIHNAEERRALAGARDLSADRPRLQHSFSFAGFPSAMTAMTGLLDSPTSITPPPILSADDLIXSSPTLPDGTNNPF^ 
FSSQELASLFAPSMGLPGGGSPTTFLFRPMSESPHMFDSPPSPQDSLSDQEGYLSSSSSSHSGSDSPTLDNSRRLPIFSRLSIS 
DD 

328 

cggagtcagamggcgaggggcgccgggmctggcgtgtgggactccagacaggagaggctgcgccttccccgcaccggga 
20 tcgcgacacaccagatcctcgcccctggctcgcgcgmcgcacaggatgaccaccaccctcgtgtctgccaccatc 
gagcgmgttttatgcaagggtmcmgatgctcmctatagtgctcccagtgcagggggttc 
gggcacccctgctggtgggggcttccctcggaggcactcagtcaccctgcccagctccmgttccaccagmccagct^ 
cagcctcaagggtgagccagcccccgctctgagctcgcgagacagccgcttccgagaccgctccttctcggaagggggcgagcg 
gctgctgcccacccagmgcagcccgggggcggccaggtcaactccagccgctacmgacggagctgtgccgcccctttc 
25 amcggtgcctgtmgtacggggacmgtgccagttcgcacacggcatccacgagctccgcagcctgacccgccaccccaagta 
cmgacggagctgtgccgcaccttccacaccatcggcttttgcc 

gcgccgtgccctggccggggcccgggacctctccgctgaccgtccccgcctccagcatagctttagctttgctg^ 
tgcggctgccaccgccgctgccaccgggctgctggacagccccacgtccatcaccccaccccctattctgagcgccgatgacct 
cctgggctcacctaccctgcccgatggcaccmtmcccttttgccttctccagcc^ggagc^ 
30 catggggctgcccgggggtggctccccgaccaccttcctcttccggcccatgtccgagtcccctcacatgm 

cagccctcaggattctctctcggaccaggagggctacctgagcagctccagcagcagccacagtggctcagactccccgacctt 
ggacmctcmgacgcctgcccatcttcagcagactttccatc 

tccagcccctaccctgcacccacatcccataccctcttctccctacccatcccattccccacaggccctacattm 

agctcmcccctttcccccagcacctcagmtgtgccctccct^ 
35 tgtcagtagcttcttctggcttgamccccctccct^ 

gtcagtagatgcctttttttttccggcttmgccttm 

ttagtgccttgtmtctmctttgtcactgtgactacattacctcttcagcgccagag^ 

tgcccatggcggggtggagacccggmccagcagccccctccactggcgacac^^ 

acacmttcctcctcccctcttcccggtggcacctctcca 
40 gagttgttgccagaccagggttttgggggaaacctgtcttgacattcaaaacc 

mtcttgcctgggtttgtgtaggtctgcaggmggm 

ttmttmtcttttttttaagtggggaggmggggmgc 

gttcatatgtgacatctttttaaaaaaaatm 

agttmtggtattcattccacatacmtatctgtgtaamcgatttcctgtagm 
45 ccgtac^tctatccttagagcactcacgccatgct^ 

AAMTTTTGCTTGTCACTGCACATCMTATAAAAMGCTTATTTM 
TTGTTMTTTTGTTCATATTTATCGG 
TAAMTTTTGTGACGTGTCTTTTTCCTTTTTTTCCAC 
TTTTTGGTTATTTTATTTTAAAAAMTGAAAAATTM 
50 CCCCTTTGTGTOGACTGCAMTTGAGTTTCTTTCTCTTTAGGCC 
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GACAGTACAGMGGAAMCAACTTTTTATGTATAGCTO 
GCCCATCTCMGACATTCCACTCACAGATTTGAGGTTCTGGATTCCAGGTCTGGAG 
GCACACACACATTMGATGMTGTMTOTTATTCCTCTTGCTGGTCACTACCGTCGCTTO 
TTTATTTAAMGAAAAAAAMCTTTTTGTMCGACTATTTGCAGTTTA 

5 329 

MATFPPATSAPQQPPGPEDEDSSLDESDLYSLM^ 
330 

CAGCCTTATMGCTGAGGGAGTGGAGAGGCCCGGGGCCAGGAMGCAGAGACAGACAAAGCGTTAGGAGAAGAAGAGAGGCAGG 
GAAGACAAGCCAGGCACGATGGCCACCTTCCCACCAGCAACCAGCGCCCCCCAGCAGCCCCCAGGCCCGGAGGACGAGGACTCC 

10 AGCCTGGATGAATCTGACCTCTATAGCCTGGCCCATTCCTACCTCGGAG^ 

GCTGCCAACACCAACCGCCCCAGCCCTGGCGGGCACGAGAGGAAACTGGTGACCAAGCTGCAGAATTCAGAGAGGAAGAAGCGA 
GGGGCACGGCGCTGAGACAGAGCTGGAGATGAGGCCAGACCATGGACACTACACCCAGCAATAGAGACGGGACTGCGGAGGAAG 
GAGGACCCAGGACAGGATCCAGGCCGGCTTGCCACACCCCCCACCCCTAGGACTTATTCCCGCTGACIX3AGTC 
ACCAGGAAAGCGCCTCCAAGCCTAGCAAAAGTGCMGATGGGGAGTGAGAGGCTGGGMTGGAGGGGCAGAGCCAGGMGATCC 

15 CCCAGAAAAGAAAGCTACAGAAGAAACTGGGGCTCCTCCAGGGTGGCAGCAACAATAAATAGACACGCACGGCAGCCACAGCTT 
GGGTGTGTGTTCATCCTTGTTAAAAAAAAAAAAAAAAAAAAAAAAAA 

331 

MEGISIYTSDNYTEEMGSGDYDSl^EPCFREENANFNKIFLPTIYSIIFLTGIVGNGLVILVMGY 
LLFVITLPFWAVDAVANWYFGNFLCKAVHVIYTWLYSSVLILAFISLDRYLAIVHATN 
20 IPDFIFANVSEADDRYICDRFYPNDLWWVFQFQHIMVGLIL^ 

LPYYIGISIDSFILLEIIKQGCEFENTWKWISITEALAFFHCCLNPILYAFLGAKFKTSAQHALTSVSRGSSLKILSKGKRGG 
HSSVSTESESSSFHSS 

332 

GTTTGTTGGCTGCGGCAGCAGGTAGCAAAGTGACGCCGAGGGCCTGAG 
25 TACCATGGAGGGGATCAGTATATACACTTCAGATM^ 

CTGTTTCCGTGMGAAAATGCTMTTTCMTAAMTCTTCCTGCCCACCATCTACTCCAT 

CAATGGATTGGTCATCCTGGTCATGGGTTACCAGAAGAM 

CGACCTCCTCTTTGTCATCACGCTTCCCTTCTGGGCAGTTGATGCCGTGGCA 

AGTCCATGTCATCTACACAGTCMCCTCTACAGCAGTGTCCTCATCCTGGCCTTCATCAGTCTGG^ 
30 CCACGCCACCMCAGTCAGAGGCCMGGMGCTGTTGGCTGAAMGGTGGTCTATGTTGGCGTCTGGATCCCTGCCCTC 

GACTATTCCCGACTTCATCTTTGCCAACGTCAGTGAGGCAGATGACAGATATATC 

GGTGGTTGTGTTCCAGTTTCAGCACATCATGGTTGGCCTTATCCTGCCTGGTATTGTCA 

CTCCMGCTGTCACACTCCMGGGCCACCAGMGCGCMGGCCCTCMGACCACAGTCATCCTCATCCTGGCTOT 

TTGGCTGCCTTACTACATTGGGATCAGCATCGACTCCTTCATCCTC 
35 TGTGCACMGTGGATTTCCATCACCGAGGCCCTAGCTTTCTTCCACTGTTC 

CAAATTTAAMCCTCTGCCCAGCACGCACTCACCTCTGTGAGCAGAGGGTCC 

TGGACATTCATCTGTTTCCACTGAGTCTGAGTCTTGAAGT'ITrCACTCC 

ATAMTMCTTTTTTTTMGTTACACATTTTTCAGATATAAM 

TTGTCTTGTGTTTCTTTAGTTTTTGTGMGTTTM 
40 GCAGGACCTGTGGCCMGTTCTTAGTTGCTGTATGTCTCGTGGTAGG 

GTGAATCACGTAAAGCTAGAAATGATCCCCAGCTGTTTATGCATAGATAATCTCTCCA 

TAAGACGTGATTTTGCTGTAGMGATGGCACTTATAACCAMGCCCAMGTC^ 

GTGGGTTGATTTCAGCACCTACAGTGTACAGTCTTGTATTMGTTGTTMTAAMGTACATGTTAM 

333 

45 MALKERIGWRYSLLFVGLLQLNIVWGA^ 

TNPELEPKADLQQVLVKTSPRPSKKKAPLLDFSILKEKSFICYALFGLLATL 
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AGCMTTCATGGCTCTGMGGAGCGCATTGGCTGGAGAT 
TCGGAGCACTGCTCAGACCCATCATCATCAGAGGACCAGCGTCACCAAA^ 

ATATGCTTGAAMTGAGAAMCACGMCCTCMTAGACTCCATTGACTCAGGAGTAGMCTMCTACCTCACCTAAA 

CTACTCACACTMCCCAGMCTGGAGCCGMGGCAGACCTGCAGCAGGTCCTGGTGAAGACCAGCCCCAGGCCMGCAAAM 
5 AAGCCCCGCTATTAGACTTCTCCATTTTGAMGAGAAMGTT^ 

TCTTTGCACCTTCCTTGTACATCATTCCTCTGGGCATTAGTCTGGGCATTGACCAGGAC 

TGGCCATTGCAGMGTTTTCAGGAGGATCGGAGCTC^TTTO 

TCTGCGTCATCTTATTGACTGTGTCTCTGTTTGCCTTTACTTTTGC 

TTCGGTTTATGGTTGGMCMCAGGAGGGACCCACATTCCACTGCTTGCTGAGGATC 
10 CTGCAGCTGGGGTCTACATCTTCATTCAGAGCATAGCAGGACTGGCTGGACCACCCCTO 

GATCTACAGCAGGGCCTTCTACTCCTGCGCAGCTGGCATGGCCCTGGC 

GGGACTGTGCCAGCATCATCACTCAGGTGAAACAAAGGTAGTGAGCCATC^ 

TCTCGAMTGGATCTTGCAAAAAATGAGCACAGAGTTCACGTGCAAATG^ 

ACTGTGTTGGCTGGAGAGGGATGGGGTGGGCCCMCGGAG 
1 5 ACTACATTTTAMGGGMTGTGTATGTGMGAGCACTACC 

GCCAAMCAAAAMCMCCMGCACTCTTCCATATAT^ 

CTTTGGTTCACATTCCMTATTAAMTAGTGACACGMCTGGCAMGTGGTTTT 

TTTCTTTCTTTTCTTTCTTCGTATGGTCTTGTCAGMTA 

TGAAATTGGCCAGTC 

20 335 

MTWSVPQREPLVLGGRLAPLGFSSRGYFGALPMVTW 
VAGGTPRMPKKRRKKKVRASPAGQLPSRFHQYQQHRPSLEGGRSPATG 

PMPGQTPLRKEVLKSKMGKSEKIALPHGQLVHGIHLYEQPKINRQKSKYNLPLTKITSMRNENNFWQDSVSSDRIQKQEKKP 
FKNTENIKNSHLKKSAFLTEVSQKENYAGAKFSDPPSPSVLPKPPSH^^ 

25 336 

TGTTCCGCGATCTTCTCAQGCTCTCCTAGCAGCATCCATCGCCGCCACCCTATCTTCACTGGC 

CGTTGCTGAGCGACMGCTTCCTAGCGCTATGACTGTCGTCTCCGTCCCGCAG^ 

CGCCGCTTGGCTTTTCQTCCCGAGGTTACTTTGGGGCCCTCCCGATGGTGACCACGGCTCCGCCTC 

ACCCCCGGGCACTGCCCCCGACCCTCTTCCTCCCTCATTTCCTAGGGGGAGATGGCCCGTGTCTG 
30 CAGCAGCTCTGCCCMCCGCAGCCTCGCCGTGGCGGGAGGCACTCCTCGGGCAGCGCCGAAGMGCGGCGAAAGAAGAAGGTGC 

GGGCCAGCCCCGCAGGGCAGCTGCCCAGCCGCTTCCACCAGTACCAGCAGCACCGGCCGAGTCTGGAGGGCGGCCGGAGCCCCG 

CGACCGGCCCGAGCGGAGCGCAGGAGGTCCCGGGCCCGGCCGCCGCCTTGGGCCCGAGTCCTGCAGCCGCAGCCGGCACGGAGG 

GAGCCAGCCCCGACCTTGCCCCGCTGCGGCCCGCGGCTCCCGGCCAAACCCCCCTCAGGAAAGAGGTTTTA 

GAAMTCGGAGAAMTTGCCCTTCCCCATGGCCAGCTTGTTCATC^ 
35 AMGCAMTATAACTTGCCACTAACCAAGATCACCTCTGCAAAMGAMTGAAMCMCTTTTGGCAGGATTC 

ACAGAATTCAGMG(^GGAAAAAMGCCTTTTAAAMTACCGAGM 

CTGMGTGAGCCAAMGGAAAATTATGCTGGGGCAMGTTTAGTGATCCACCTTCTCCTAGTC 

ACTGGATGGGMGCACTGTTGAAMTTCCAACCAAMCAGGGAGCTGATGGCAGTACACTTAAAM 

CTTAGATTTCAGATTTCAGTATGTGTGTAAAACATM 
40 MTTTGCCTTGTTGCMCATACMTTGCAAMGATGAGTTTAAAAAA 

TAMTACTGTATACCATGTATTATGTGTATATTGTTCATACT^ 

CCCTGCCCACATTGCAGGTGTTTTGTATATATA^ 

TTGCACMGGTACTGAGATTTTTTTCMGAMCAGCTGT^ 

CTACTGMGTTTAMTCTGTGGCACMTCMTGTMGCATGGGGTTTG 
45 CAACTCCTATTCCCATTTTTGCTAMCTCMTTTCTGGTTTTGGTATATAT 

AACAAAATTGGTTGTMTCAAATTTTAAAATM^ 

GTTTCTCATGTTTGMTGTGTGACTAGGAGATGA 

CCGCCATGCCTGGCCATMTCTACATTTTCTTAC(^ 
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TGAGACGGGAGCATCTCTTGAGCCTGAGAAG 
GACGCTGTCTCAAAAAAAAGGCCAAGAGAAAGTAAGGGAGACAGA 

337 

MAEVEDQAAWDMKRLEEKDKERKNVKGIRDDIEEEDIXJEAYFRYMAENPTAGWQEEEED 
5 DHSEIDYPPFEKNPYNEHEEITNLTPQQLIDLRHKLNLRVSGAAPPRPGSSFAHFGFDEQLMHQIRKSEYTQPTPIQCQGVPVA 
LSGRDMIGIAKTGSGKTAAFIWPMLIHIMDQKEL^ 
ALQEGAEIWCTPGRLIDHWKKATNLQRVSYLWDEAD^ 
PIRWQGDIGEANEDVTQIVEILHSGPSKW^ 

VI SDFKKKDI PVLVATDVAARGLDI PSIKWIMDVARDIDTHTHRIGRTGRAGEKGVAYTLLTPKDSNFAGDLVRNLEGANQH 
10 VSKELLDLMQNAWFRKSRFKGGKGKKLNIGGGGIjGYRERPGLGSENMDRGNNNW 

kshfvaaslsnqkagssaagasgwtsagslnsvptns aqqghns pds pvtsaakgi pgfgntgni sgapvtypsagaqgvnnta 

sgnnsregtggsngkrerytenrgssrhshgetgnrhsdsprhgi^^ 

ganix;rngesrkeafnreskmepkmepkvdssk^ 

338 

15 GTCCATCTTGCTGCTGTCCACTTTGGGTTCCATCTTGGGCTGCATCTTC 

ATTCCGACCATCATTTGCACCCCGGTTCTCCCCATGCCGGCCTGCGCTTCCTCC 

AGTATGACGGCTGCTCCTTTCTGGATGGCGGTATCCATCTCCAT 

ATTGCCAGTCTCTCCGTGACTGTGACGGCTGCTGCCCCGGTTCTCAGTATATCTCrc 

TTCTCGGCTGTTATTCCCTGMGCTGTGTTGTTGACTCCTT^^ 
20 GCCAGTATTGCCAAAGCCTGGGATGCCCTTGGCGGCACTGGTGACGGGGCTC 

AGTTGGMCAGMTTCMGCTCCCTGCACTAGTCCACCTGCCCCACTGCTTCTTCGCTTCTCTC 

TCGGCTTGCMTGGACCCCMCTGCTCCTGCGCCGCTGGTGTCTCCTGCACCTGCGCTGGTTCCTGCMG 

ATGCACCTCCTGCAAGMGAGCTGCTGCTCCTGCTGCCCCGTGGGCTGTAGCMGTC 

GTCAGAGMGTGCAGCTGCTGCGACTGATGCCAGGACAACCTTTCTCCCAGATGTAMCAGAGAGACATGTAC^ 
25 TTTTTTTTATACCACCTTGACCCATTTGCTACA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

339 

mtqtlkyasrvfhrvrwapelgaslgyreyhsarrslad^ 
30 tvwmpalveellrqegprpercsfspwtehrrcrqracglltaegeewq^ 

lrrqrgrgtg ppalvrdvagefykfglegiaavllg srlgcleaqvppdtetf iravgsvfvstlltmamphwlrhlvpgpwgr 
lcrdtoqmfafaqrhverreaeaamrnggqpe^ 

pevqtalihseitaalspgssaypsawlsqlpllkawkevlrlypwpgnsrvpdkdihvgdyiipkl^ 
qfpepnsfrparwlgegptphpfaslpfgfgkrscmgrrlaelelqmalaqi^ 

35 FLDR 

340 

GGGGTTGAGATATGATGCTCAGGAGMGCGCTTTCTTTCGC 

TCCAGAGTGnCCATCGCGTCCGCTGGGCGCCCGAGTTGGGCGCCTCCCTAGGCTACCGAGAGTACCACTC 
TTGGCAGACATCCCAGGCCCCTCTACGCCCAGCTTTCTGGCCGMCTTTTCTGCM 

40 CAGGTGCAGGGCGCCGCGCACTTCGGGCCGGTGTCGCTAGCCAGCTTTGGGACAGTGCGCACCGTGTACGTC^ 

CTCGTCGAGGAGCTGCTGCGACAGGAGGGACCCCGGCCCGAGCGCTGCAGCTTCTCGCCCTGGACGGAGCACCGCCGCTGCC^ 
CAGCGGGCTTGCGGACTGCTCACTGCGGMGGCGMGMTGGCAMGGCTCCGCAGTCTCCTGGCCCCGC 
CMGCGGCCGCCCGCTACGCCGGMCCCTGMCMCGTAGTCTGCGACCTTGTGCGGCGTCTGAGGCGCCAG 
ACGGGGCCGCCCGCCCTGGTTCGGGACGTGGCGGGGGMTTTTACMGTTCGGACTGGAAGGC^ 

45 TCGCGCTTGGGCTGCCTGGAGGCTCMGTGCCACCCGACACGGAGACCTTC^ 
- CTGTTGACCATGGCGATGCCCCACTGGCTGCGCCAC 
TTTGCATTTGCTCAGAGGCACGTGGAGCGGCGAGAGGCAGAGGCAGCCATGAGGMCGGAGGACAGCCCGAGMGGA 
TCTGGGGCGCACCTGACCCACTTCCTGTTCCGGGAAGAGTTGCCTGCCCAGTCCATCC 
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GCGGGAGTGGACACGGTGTCCMCACGCTCIK^TTGGGCT^ 
TCAGAGATCACAGCTGCCCTGAGCCCTGGCTC 

GTGGTCMGGMGTGCTMGACTGTACCCTGTGGTACCTGGAMTTCTC^ 

ATTATCCCCAAAMTACGCTGGTCACTCTGTGTCACTATGCCACT^ 
5 CGTCCAGCTCGCTGGCTCX^5AGGGTCCCACCCCCCACCCATTOCAOT 

GGGAGACGCCTGGCAGAGCTTGMTTGCAMTGGCTTTGGCCCAGATC 

GCCCCAGTTAGACCCAAGACCCGGACTGTCCTGGTACCTGAMG^^ 

AAGAGACTGTCATCATCACCCTTTCATTCATCATAGGGATMGATTTTTTGTA^ 

ATGCCTATCTGACCAAACTGGATAGMCCACCATAGTGMGTGTGAGGCGG 
10 GACTCAGGMGCCAG&TGAGAAMCCATGGTCTCTCTC 

CAGATTTTMCTAATAATGCTGGATGG 

GCTGGCTMGCATTTATCAAAGCATMGCTCAGTMCT 

CTCTTTGAGAGGMGGGTGMGCCTTATTTGTTTTTTATG 

TCTTAGATTGAATCTGMCCATGTGGCAGAAGGGATAAGCAGCTTACTTAGTA 
15 GCCCCTAGGMGGTGAATCTGCCCTAGCCTGGTTTACGGTTTCT^ 

TGCCCCATCACmGTTCTCAGGCAGAGACATCTTTGGGCCTGTCCCTGCC^ 

AATATTCACAMTTTTAGAATAMTCAMTATTCCATTCTT 

341 

MR^EGREIPSLGGARRREVLQAGRSQRMGRRRRRQELELGVGSGRPGGPPPGPGRRGTCAMLPPEW 
20 PLAMAKWLl^YFSLGNSKTKSPPQPPRPDYREQRRRGERPSQPPQAVPQASSAASASCGPATASCFSASSGSLPDDSGSTSDLI 
RAYRAQKERHFQDPYNGPGSSLRKLRAMCRLDYCGGSGEPGGVQRAFSASSASGAAGCCCASSGAGAAASSSSSSGSPHLYRSS 
SERRPATPAEWYISPKHRLIKVESMGGGAGDPLGGACAGGRTW^ 

FDAKNDLKSKAGKGESAGYMEPYEAQRIMTEFQRQESVRSQHKGIQLYDTPYEPEGQSVDSDSESTVSPRLRESKLPQDDDRPA 
DEYDQPWE^RVTSPALAAQFNGNEKRQSSPSPSRDRRRQLRAPGGGFKPIKHGSPEFCGILGERVDPAVPLEKQIWY 
25 GDAENLLRLCKECSYLVRNSQTSKHDYPLSLRSNQGFMHMKLAKTKEKWLGQNSPPFDSVPEVIHYYTTRKLPIKGAEHLSLL 
YPVAVRTL 

342 

CGGGCCGCCGGGACGGGCACGGGCGCGCGGGCTCCGGCGGGCGCCGGCTGCCTTCCTCCGTCGCTCGCTGTCTCTCCCGGCC 
ATTCTCCTCCGCTGCGGGGCCGAGCTCTCCCCAGCGCTCGCAGGMGGMGMGGGAGCCGAGGACGCCGAGAAG 

30 CAGCCGCGGATCCCGGCCAAGGCGGAGGCTGCGGCTCCGACGGGGCAGGAGCGCGATCCACGGCGAGGGGCGTACGGCCAAAGG 
GTCCGCGGCGTGGAGCGCTCGGACCTTCCGCTCTCCCCCGGGCGTGGGCCGGGACCCCATGAGACGCGCCCACGAGGGGCGCG 
GATTCCTAGCTTGGGCGGCGCTAGGCGGAGGGAGGTGTTGCAGGCCGGCCGGAGCCAGAGAGCTGCCGGCAGGAGGCGGCGGCG 
GCMGAACTTGMCTTGGCGTCGGGAGCGGGCGCCCCGGAGGCCCCCCGCCGGGGCCGGGGCGCCGAGGGACCTGCGCCGCAGC 
• GCTGCCCCCCGMTGGCCGCGGCGGCGGACCGGGCTC^ 

35 CAAGTACTTCAGCTTGGGCAACAGCAAGACCAAGAGCCCCCCGCAGCCGCCGCGGCCAGACTACCGCGAGCAGCGGCGCCGAGG 
CGAGCGGCCTTCGCAGCCCCCCCAGGCCGTGCCGCAGGCCTCCTCCGCCGCCTCGGCGTCCTGCG 

CTTCTCAGCCTCTTCGGGCTCGCTGCCCGACGACAGCGGCAGCACCAGCGACCTCATCCGCGCCTACCGCGCGCAGAAGGAGCG 
ACACTTCCAGGACCCCTACAACGGGCCTCGCTCGTCGCTGC 

CGGGGAGCCAGGCGGGGTCCAGCGCGCCTTCTCGGCCTCGTCCGCGTCGGGCGCCGCGGGCTGTTC 
40 GGGCGCCGCCGCGTCCTCGTCCTCGTCCTCC^ 
CGAGGTGCGCTACATCTCCCCCMGCAC^ 

CGCCTGCGCGGGCGGCCGCACCTGGAGCCCGACGGCCTGCGGAGGCMGAMCTGCTGAACMGTGCGC 
GGAGAGCC^GCCGGCMGMGGACAAGGTGACCATAGCCGATGACTACTCAGATCCCTTTGATGCCMGMTGA 
CAMGCAGGAMGGGGGAGAGTGCTGGCTACATGGAGCCCTATGAGGCAC^ 
45 TGTCCGGTCCCAGCATAAAGGTATCCAGTTAT^^ 
CACAGTCAGCCCCCGACTGCGGGAGAGCMGCTC^^ 

GMCCGGGTCACCAGCCCAGCCCTGGCAGCACAGTTTAATGGCMCGAGAAGCGGCAGTCATC^ 
GCGGCGCCAGCTTCGTGCCCCTGGAGGGGGCTTTMGCCTATCAAACATGGGAGCCC^ 
GGTGGATCCTGCCGTCCCCCTGGAGAAGCAMTATGGTATCACGGAGCCATCAGCAGAGGAGACGCCGAGM 
50 CTGCAAGGAGTGTAGCTACCTTGTCCGGMCAGCCAGACCAGCM 



WO 02/46465 



382 



PCT/GB01/05458 



TATGCACATGAAACTGGCCAAMCCAAAGAGAAATACGTO 
CCACTACTACACCACCAGAMGCTACCCATCAMGGGGCTG 

AGCGGACCAGACCTGCCCTGCTCTGTGACAGAGCCTGGAGACTTGGAGGTGCCAGAGGCCCCCCACCAACCAGCCCCCAGCCAC 
TGTTGCTGGCTGTGTCGTTTGTGTTGTGTGTATGGTACT 
5 GCTGGAGGCCTGGATAAAGACAGAAGGGCGGCAACACC 

343 

MASVLSRRLGKRSLLGARVLGPSASEGPSMPPSEPLLEGMPQPFTTSD^ 
YGGQECTGLVEQHSMEGQVTVVfljLEQKLQVCCR^ 
MAAMVLTSLSCSWQSPPGTEANFSASm^ 
10 HGFETDPDPFLLDEPAPRKRKNSVKVMYKCLW 

AAAMGTPVPGTPTSEPAPTPSMTGLPLSALPPPLHKAQSSGPEHPGPESSLPSGALSKSAPGSFWHIQADHAYQALPSFQIPV 

SPHIYTSVSWAMPSMCSLSPVRSRSLSFSEPQQPAPAMSHLIVTSPPRAQSGARKARGEAKKCRKVYGIEH 

KKACQRFLD 

344 

15 MGTGCGCATGTGCGCGAGGAGTCGCTCGGGCACTTATTGAGCGC^ 

TCAGTGTCCCCTTCGCCTCCGCAGGAGAGGAGAGGCAGCAGCATGGCGAGTGTCCTGTC 

CTGGGAGCCCGGGTGTTGGGACCCAGTGCCTCGGAC^GGCCCTCGGCTGCCCCACCCTC 
. CCCCAGCCTTTCACCACCTCTGATGACACCCCCTGCCAGGAGCAGCCCAAGGAAGTCCTTAAGGCTCCCAGCACCTCGGGCCTT 

CAGCAGGTGGCCTTTCAGCCTGGGCAGMGGTTTATGTGTGGTACGG 
20 TGGATGGAGGGTCAGGTGACCGTCTGGCTGCTGGAGCAG^ 

CTGCAGGGCCCCTGTCCCCAGGCACCACCCCTGGAGCCCGGAGCCCAGGCCCTGGCCTACAGGCCCGTCTCCAGGMCA 

gtcccamgaggmgtcggacgcagtggamtggatgagatgatggcggccatggtgctgacgtccctgtc 
gtacagagtcctcccgggaccgaggccmcttctctgcttcrc 

gacagcggcagcagcactaccagcggtcactggagtgggagcagtggtgtctccaccccctcgcccccccacccccaggccagc 
25 cccmgtatttgggggatgcttttggttctc^ 

ccagctccacgaaamgamgmctctgtgmggtgatgtacmgtgcctgtggccamctgtggcamgtt 

gtgggcatcamcgacacgtcamgccctccatctgggggacacagtc^ 

tacacagaggtgcagctgmggaggmtctgctgctgctgctgctgctgctgccgcaggcaccccagtc 

tccgagccagctcccacccccagcatgactggcctgcctctgtctgctcttccaccacctctg 
30 ccagmcatcctggcccggagtcctccctgccctcaggggctctcagcmgtcagctcctgggtccttctggcaca 
» gatcatgcataccaggctctgccatccttcca^ 

tccgccgcctgctctctctctccggtccgga^ 

catcagatcgtcacttctccaccccgggcccagagtggtgccag^ 

ggcatcgagcaccgggaccagtggtgcacggcctgccggtggmgm 
35 gttctactctgttcctggccctgccggcagccactgacmgaggccagtgtgtcaccagccctcagcagaaaccgaaagagaaa 

gaacggamcacggagtttgggctctgtt^^ 

ttaaaaaamgamcaaaaatatttttcccot 

actamgaattmttaccctccgtto^ 

ttctgcttcctgctgagtcggtgmttctttgctttct 

40 345 

MAMHFI FSiyTAVLLFDFWSVH SPAGMALSVLVLLL I SQQTIAETDGDSAGSDSFPVGR 

THHRWYLCHFGQSLIHVIQWIGYFIMLAVMSYNTWIFLGVVLGSAVGYYLAYPLIjSTA 

346 

TCGGCACAGGAGCGAGGAGACCCGAGAGCAGACGCGCCCTGGCGCCCGCCCTGC^ 
45 TCAGATACAGCGGTGCTTCTGTTTGATITC 

CTGGCTGTACTGTATCMGGCATCMGGTTGGCAMGCCMGCTGCTCMCCAGGTACTGGTG 

CAGCAGACCATCGCAGAGACAGACGGGGACTCTGCAGGCTCAGATTCATTCCCTGT^ 

TGTCACTTTGGCCAGTCTCTMTCCATGTCATC^ 
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TGGATTTTCCTTGGTGTGGTCTTGGGCTCTGCTGTGGGCTACTACCTAGCTTACCCACTTCTCA 

MCGTGCAGGCACTGAGGCTGGAGGGACATGGAGCCCCCTCTTCCAGACACTATACTTCCMCTGCCCTTTC 

TTCCTCCACCTTATTCCCAGCCCCTGGAMCTTTGA(X:TGMGCCAGCACTTC 

CTTCCTTCTCAGCCAGCCTACGTAGGGCCCAGGCATGGTCTTGTGTCTTMGACAGCTGCTC 
5 MCAGGGGTGGCAGGGTTACTGAGCCCATGACMTGCTTCTCTGTGACTCAMCCAGGM 

GMGGGTTCTTGGTGATGCAGGGCATGGMCCTGGACACCCTCAGCTCTCCTGC 

TTC^ACTTCCTGACCTCTTCTGTCTTTGAGGGACAGAGACCMGCTAGATCCT 

AAGATCATCTCGTCTATGGATCATGTTGACAMCTAAGTTTTT 

TTCATACAAAAAAATTTTTMTGAMTGATTTC 
10 GGGAGAGMTMGTGACAGCTGATTAMGGCAGAGACACAGGACTGCTTTC 

CCTTCCACCAGMGGCACTGCCTGCAGGMGMGATGATCTGATGGCCGTGGGTGTCTGGGMGCTCTTC 

CTCCTTTATCCTCATCTTTCTTCTATGCAGMC 

TACTGCTTACTCGTAATGATCTAGTGGGGAMCATGATTCATTC 

AGATGCMTCCAACCAMGCCATTACATTTTTTGAGTTAGATGGGACT 
15 GAAAGAGAGAAMTCACTGCTGTATACTAAATACCTCACAGATTAGATC 

ATACATTTTAGTAAATAT 

347 

MSRFVQDLSKAMSQIX5ASQFQEVIRQELELSVKKELEKILTTASSHEFEHTKKDLDGFRKLFHRFLQEKGPSVDWGKIQRPPED 
' S IQPYEKI KARGLPDN I S SVLNKLVVWLNGGLGTSMGCKGPKSLIGVRN 
20 EDTKKILQKYNHCRVKIYTFNQSRYPRINKES^^^ 

DNLGATVDLYILNHLINPPNGKRCEFVMEVTmTRADVKGGTLTQYEGKLR 
AAVKRLQEQNAIDMEIIWAKTLDGGLW^^ 

KREFPTVPLVKI/3SSFTKVQDYLRRFESIPDMLELDHLTVSGDVTFGKNVSLKGTVI I IANHGDRIDI PPGAVLENKIVSGNLR 
ILDH ... 

25 348 

AGGAGAGGMGAGAGACCTGCCCTGTAGCGTGACTCCTCTAGAAAAAAAAAAAAAMGCCGGAGTATTTO 

ATGTCGAGATTTGTACMGATCTTAGCAMGCMTGTCTCMGATGGTGCTTCTCAGTTCCAAGAAGTCATTCGGCAAGAGCTA 

GAATTATCTGTGMGMGGMCTAGAAAAAATACTCACCACAGCATCATCACATGM 

GGATTTCGGMGCTATTTCATAGATTTTTGCAAGAAMGG^ 
30 TCGATTCMCCCTATGAAMGATAMGGCCAGGGGCCTGCCTGATMTATATCTTCCGTGTTGAACAMCTAG 

CTCMTGGTGGTTTGGGAACCAGCATGGGCTGCAMGGCCCTAA^ 

CTGACTGTTCAGCAMTTGMCATTTGMCAAMCCTACMTACAGATGTCCCTCTTG 

GMGATACCAAAAAMTACTACAGMGTACMTCATTGTCGTC 

MTAMGMTCTTTACGGCCTGTAGCAMGGACGOT 
35 GATATTTACGCCAGTTTCTACMCTCTGGAT^ 

GATMTCTGGGTGCCACAGTGGATCTGTATATTCTTMTCATCTMTC 

GMGTCACAMTAAAACACGTGCAGATGTAMGGGCGGGACACTCACTCM^ 

CAAGTGCCAAMGCACATGTTGACGAGTTCMGTCTGTATCAM^ 

GCAGCAGTTAAAAGACTGCAGGAGCAAMTGCCATTGACATGGAMTCATTGTGMTGCAMG 
40 GTCATTCMTTAGAAACTGCAGTAGGGGCTGCCATCAAMG 

CTGCCTGTCAAMCCACATCAGATCTCTTGCTGGTGATGTCAMCCTCTATAGTCTTMTGCAGGATC 
MGCGGGMTTTCCTACAGTGCCCTTGGTT^ 

ATACCAGATATGCTTGMTTGGATCACCTCACAGTTTCAGGAGATGTGACATTT^ 
ATCATCATTGCAAATCATGGTGACAGMTTGATATCCCACCTGGA 
45 ATCTTGGACCACTCAMTGAAAMTACTGTGGACACTT^ 

GCACTAAMGGTACTTTACTATGTTACTGTACCCTGCAGTGTTGATTTTTAAM . 
AGAAMGCACAGATGGTGCAATACTTTCOTCTTTGMGAGATCCCAMG 
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MKLWSALLMAWFGVLSCVQAEFFTSIGHMTD 
NAYKLVKRLOTDWPALEDLVLQDSMGFIMLSV 

CFGMGRSAYNEGDYYHmWMEQVLKQLDAGEEATTTKSQVLDYLSYAWQLGDLHRALEL^ 
LLEEEREKTLTNQTEAELATPEGIYERPVDYLPERDVYESLCR 
5 HIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVL^ 

YGVGGQYEPHFDFSRTOERDTFKHLGTGI^VATFLNYMSDVEAGGATVFPDIjGMIW 
PVLVGCKWVSNKWFHERGQEFLRPCGSTEVD 

350 

GGGGMGGMCACTGTAGGGGATAGCTGTCCACGGACGCTGTCTACMGACCCTGGAGTGAGATMCGTC^ 
10 CTGCATGTGTAAGATGCCCAGTTGACCTTCGCAGCAGGAGCCTGGATCAGGCACTTCCTGCC 
GACACTTCCCTCTGTGACCATGAMCTCTGGGTGTCTGCATTGCTGATGG 
ATTCTTCACCTCTATTGGGCACATGACTGACCTGAmA 

GGAGGMGCCMGCTTTCCMGATTMGAGCTGGGCCMCAAAATGGMGCCTTGACTAGCMG 
CTACCTGGCTCACCCTGTGMTGCCTACAMCTGGTGMGCGGCTAMCACAGACTGGCCTGCGC 

15 GGACTCAGCTGCAGGTTTTATCGCCMCCTCTCTGTGCAGCGGCAGTTCTTCCCCACTGATGAGGAC 
AGCCCTGATGAGACTTCAGGACACATACAGGCTGGACCCAGGCACMTTTCCAGAGGGGMCTTCCA^ 
MTGCTGAGTGTGGATGACTGCTTTGGGATGGGCCGCTCGGCCTACMTGMGGGGACTATTATC 
GCAGGTGCTAMGCAGCTTGATGCCGGGGAGGAGGCCACCACMCCMGTCACAGGTGCTGGACTACCTCAGCTATGCTGTC 
CCAGTTGGGTGATCTGCACCGTGCCCTGGAGCTCACCCGCCGCCTGCTCTCCCTTGACCCAAGCCACGAACGAGCTGGAGGGAA 

20 TCTGCGGTACTTTGAGCAGmTTGGAGGMGAGAGAGAAAAMCGTTMCAAATCAGACAG 
AGGCATCTATGAGAGGCCTGTGGACTACCTGCCTGAGAGGGATC^ 

ACCCCGTAGACAGAAGAGGCTTTTCTGTAGGTACCACCATGGCMCAGGGCCCCACAGCTGCTCATTGCCCCCTTCAM 

GGACGAGTGGGACAGCCCGCACATCGTCAGGTACTACGATGTCATGTCTGATGAGGAAATCGAGAGGATCAAGGAGATCGCAAA 

ACCTAMCTTGCACGAGCCACCGTTCGTGATCCCMGACAGGAGTCCTCACTGTCGCCAGCTACCGGGTTTCCAAMG 
25 GCTAGAGGAAGATGATGACCCTGTTGTGGCCCGAGTAAATCGTCGGATGCAGCATATCACAGGGTTAACAGTAAAGACTGCAGA 

ATTGTTACAGGTTGCAMTTATGGAGTGGGAGGACAGTATGMCCGCACTTCGACTTCTCTAGG 

CMGCATTTAGGGACGGGGMTCGTGTGGCTACTTTCTTAMCTACATGAGTGATGTAGM 

TGATCTGGG^CTGCMTTTGGCCTMGMGGGTACAGCTGTGTTCTGGTACMCCTC™ 

MCMGACATGCTGCCTGCCCTGTGCTTGTGGGCTGCMGTGGGTC 
30 GAGACCTTGTGGATCMCAGMGTTGACTGACATCCTTTTCTGTCCTTCCCCTTCCTGGTCCTTCAGCCCATGTC 

GACACCTTTGTATGTTCCTTGTATGTTCCTATCAGGCTGATTTTTGGAG 

CTAGGGCGACTCCTGTGTGACTGMGTCCCAGCCCTTCCATTCAGCCTGTGC^ 

CTGCAGCAGAGTTAGCTGTCTAGCGCCTAGCMGGTGCCTTTGTACCTCAGGTGTTTTAGGTG 

AGTTCTGATACCTTGTTTACATGTTTGTTTTTATGGCATTTCTATC™ 
35 AAGCCTTAAA 

351 

MPAHLLQDDISSSYTTTOTITAPPPGVLQNGGDKLETMPLYLEDDIRPDIKDDIYDPTO 

GALYGITLIPTCKFYTWLWGVFYYFVSALGITAGAHRLWSHRSYKARLPLRLFLI IANTMAFQNDVYEWARDHRAHHKFSETHA " 
DPHNSRRGFFFSHVGWLLVRKHPAVKEKGSTLDLSDLEAEKLWFQRRYYKPGLLMMCFILPTLVPWYFW 
40 RYAWLNATWLVNSAAHLFGYRPYDKNI S PRENI LVSLGAVGEGFHNYHHSFPYDYS ASEYRWH INFNTFFIDWMAALGLTYDR 
KKVSKAAILARIKRTGDGNYKSG 

352 

GACGGTCACCCGTTGCCAGCTCTAGCCTTTAMTTCCCGGCTCGGGGACCTC 
GCGTGCMGGCGCCGCGGCTCAGCGCGTACCGGCGGGTTTCGAMCCGCAGTCCTCC^ 
45 TCAGCCCCCTGGAMGTGATCCCGGCATCGGAGAGCCMGATGCCGGCCCACTTGC 

CACCACCACCACCATTACAGCGCCTCCTCCAGGGGTCCTGCAGAATGGAGGAGATMGTTGGAGACGATGCCCCTCT^ 

AGACGACATTCGCGCTGATATAAAAGATGATATATATGACCCCACCTACAAGGATMGGMGGCCCM^ 

TGTCTC^AGAAACATCATCCTTATGTCTCTGCTACACTTGGGA^ 
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cacctggctttggggggtattctactattttgtcagtgccctgggca^ 
ttacamgctcggctgcccctacggctctto 
tgaccaccgtgcccaccacmgttttcagamcacatgctgatcctcatmttc 
ttggctgcttgtgcgcaaacacccagctgtc^ 
5 gatgttccagaggaggtactacamcctggcttgctgatgatgtg^ 
tgamcttttcaamcagtgtgttcgttgccactttc 
tgcccacctcttcggatatcgtcot 
gggcttccacmctaccaccactcctttccctatgactactctgccagtc 

CATTGATTGGATGGCCGCCCTCGGTCTGACCTATGACCGGAAGAAAGTCTCCMGGC 
10 CGGAGATGGAMCTACMGAGTGGCTGAGTTTGGGGTCCCTCAGGTTCCTTTTTC 
CTGTTTATTMCTACTGMTAATGCTACCAGGATGCTAMGATGATGATC 
CAAAAGTATTGAAAGCCAAAAAAAAAAAAAAAAAAAAAAAAA 

MEPRDGSPEARSSDSESASASSSGSERDAGPEPDKAPRRLNKRRFPGLRLFGHRKAITKSGLQHLAPPPPTPGAPCSESERQIR 
15 STVDWSESATYGEHIWFETOTSGDFCYVGEQYCVAR^ 

FVRHHVJVHRRRQDGKCRHCGKGFQQKFTFHSKEIVAISCSWCKQAYHSKVSCFMLQQIEEPCSLGVHMWI 
NTLKASKKKKRASFKRKSSKKGPEEGRTOPFIIRPTPSPLMKPLLV 

LEMYRKVHNLRI LACGGDGTVGWI LSTLDQLRLKPPPPVAI LPLGTGNDLARTLNWGGG YTDEPVSKI LSHVEEGNWQLDRWD 
LHAEPNPEAGPEDRDEGATDRLPLDVFNNYFSLGFDAHVTLEFHESRE^ 
20 RWCDG^LTPKIQDLKPQCWFLNIPRYCAGTMPWGHPGEHHDFEPQRHDDGYLEVIGFTMTSL^ 
LTTSKAIPVQVI&EPCKLAASRIRIALRNQA 

LiGTVWPGDSDLELCRAHIERLQQEPDGAGMSPTCQKLSPKWCFLDATTASRFYRIDRAQEHLNYW 
ASARPDLPTPTSPLPTSPCSPTPRSLQGDAAPPQGEELIEAMRNDFCKIiQELHRAGGDLMHRDE 
YLLDHAPPEILDAVEENGETCLHQAAALGQRTICHYIVEM^ 
25 ETAV 

354 

GCGGCGCGGAGCGGGCGTGCTGAGCCCCGGCCGCCGGCCCGGCATGGGCGTCTCCCGCGGGCCCTCCGCCGGCCGGGGCTAGGG 
CCGGATGGAGCCGCGGGACGGTAGCC.CCGAGGCCCGGAGCAGCGACTCCGAGTCGGCTTCCGCCTCGTCCAGCGGCTCCGAGCG 
CGACGCCGGTCCCGAGCCGGACMGGCGCCGCGGCGACTCMCMGCGGCGCTTCCCGGGGCTGCGGCTC 
30 AGCCATCACCAAGTCGGGCCTCCAGCACCTGGCCCCCCCTCCGCCCACCCCTGGGGCCCCGTGCAGCGAGTCAGAGCGGCAGAT 
CCGGAGTACAGTGGACTGGAGCGAGTCAGCGACATATGGGGAGCACATCTGGTTCGAGACCMCGTGTC 
CGTTGGGGAGCAGTACTGTGTAGCCAGGATGCTGMGTCAGTGTC^ 

GCCCTGCATCGAGCAGCTGGAGMGATAMTTTCCGCTGTMGCCGTCCTTCCGTGMTCAGGCTCCAGGM 

MCCTTTGTACGGCACCACTGGGTACACAGACGACGCCAGGACGGCAAGTGTCGGCA 
3 5 CACCTTCCACAGCAAGGAGATTGTGGCC ATCAGCTGCTCGTGGTGC AAGCAGGC ATACCACAGCAAGGTGTCCTGCTTCATGCT 

GCAGCAGATCGAGGAGCCGTGCTCGCTGGGGGTCCACGCAGCCGTGGTCATCCCGCCCACCTGGATCCTCCGC 

CCAGMTACTCTGAMGCMGCMGMGMGMGAGGGCATCCTTCAAGAGGAAGTCCAGCAAGAAAGGGCCTGAGGAGGGCCG 

CTGGAGACCCTTCATCATCAGGCCCACCCCCTCCCCGCTCATGAAGCCCCTGCTGGTGTTTGTC 

CCAGGGTGCAMGATCATCCAGTCTTTCCTCTGGTATCTCMTCCCCGACMGTC 
40 GGCGCTGGAGATGTACCGCAMGTGCACMCCTGCGGATCCTGGCGTGCGGGGGCGACGGCACG^ 

CCTGGACCAGCTACGCCTGAAGCCGCCACCCCCTGTTGCCATCCTGCCCCTGGGT^^ 

CTGGGGTGGGGGCTACACAGATGAGCCTGTGTCCMGATCCTCTCCCACGTGGAGGAGGGG 

GGACCTCCACGCTGAGCCCMCCCCGAGGCAGGGCCTGAGGACCGAGATGMGGCGCCACCGACCGGTTGCCCCTC 

CMCMCTACTTCAGCCTGGGCTTTGACGCCCACGTCACCCTGGAGTTC^ 
45 CAGCCGCTTTCGGMTMGATGTTCTACGCCGGGACAGCTTTCTCTGACTTCC 

CATCCGAGTGGTGTGTGATGGMTGGACTTGACTCCCMGATCCAGGACCTGAAACC 

CAGGTACTGTGCGGGCACCATGCCCTGGGGCCACCCTGGGGAGCACCACGACT 

CGAGGTCATTGGCTTCACCATGACGTCGTTGGCCGCGCTGCAGGTGGGCGGACACGGCGAGC 

GGTGCTCACCACATCCMGGCCATCCCGGTGCAGGTGGATGGCGAGCCCTGCAAGCTTC 
50 (XXX:AACCAGGCCACCATGGTGCAGMGC^CAAGCGGCGGAGCGCCGCCCCCCTGCACAGCGACCAGCA(^ 
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gttgcgcatccaggtgagtcgcgtcagcatgcacgact 
gccgctgggcactgtggtggtcccaggagacagtgacctagag 

tggtgctggagccaagtccccgacatgccagamctgtcccccaagtggtgcttcctggacgccaccactgccagc 
caggatcgaccgagcccaggagcacctcmctatgtgactgac^tcgcacaggatgagatt 
5 gggggcatcggcccggcctgacctcccmcccccacttcccctctccccacctcaccctc^ 
aggggatgctgcaccccctcmggtgmgagctgattgaggctgccmgaggmcga 
agctgggggcgacctcatgcaccgagacgagcagagtcgcacgctcctgcaccacgcagtcagcact^ 
ccgctacctgctggaccacgcccccccagagatccttgatgcggtggaggaamcggggagac 
cctgggccagcgcaccatctgccactacatcgtggaggccgggcx:ctcgctcatgmgacagaccagcagggcgacactc 

10 gcagcgggctgagmggctcaggacaccgagctggccgcctacctgg 

ccaggagacggctgtgtagcgggccckccacgggcagcaggagggacmtgcggccaggggacgagcg 
cactgccacattccagtgggacggccacggggggacctaggccccagggamgagccccatgccgccccctm 
gacctagggctggactcaggagctgggggggcctcacctgttcccctgaggaccccgccggacccggaggctcacag^ 
gacacggctgggttggatatgcctttgccggggttctggg^ 

15 ggggctggagagggggaggccttcgggmgaggcttcctgggccc 

caccccacctcctccgggcttcctcccggamctcagcgcctgctgcacttgcct^ 

gaccctccccgctcccctgtcatttcatcgcggactgtgcc&cctgggot 

gctgggtgtgactcagtamgtggatttttttttcttt 

355 

20 MQIPRAALLPLLLLLLMPASAQLSRAGRSAPLAAGCPD 

EGLQCWPFGVPASATWRRAQAGLCVCASSEPVCGSDMTYANLCQLRMS 

YNFIADVVEKIAPAVVHIELFRKLPFSKREVPVASGSGFIVSEIX5LIVTNAHVVTNKHRW 

LIKIDHQGKLFVLLLGRSSELRTC^ 

GEVIGINTLKWAGISFAIPSDKIKKFLTESHDRQAKGKAITKKKYIGIRMMSLTSSKAKELKDRHRDFPDVISGAYIIEV 
25 TPAEAGGLKENDVI ISINGQSWSANDVSDVIKRESTLNMVVRRGNEDIMITVI PEEIDP 

356 

CCGGCCCTCGCCCTGTCCGCCGCCACCGCCGCCGCCGCCAGAGTCGCCATGCAGATCCCGCGCGCCGCTCTTCTCCCGCTGCTG 

CTGCTGCTGCTGGCGGCGCCCGCCTCGGCGCAGCTGTCCCGGGCCGGCCGCTCGGCGCCTTTGGCCG^ 

TGCGAGCCGC^GCGCTGCCCGCCGCAGCCGGAGCACTGCGAGGGCGGCCGGGCCCGGGACGCGTGCGGCTC^ 

30 GGCGCGCCCGAGGGCGCCGCGTGCGGCCTGCAGGAGGGCCCGT^ 

GCCTCGGCCACGGTGCGGCGGCGCGCGCAGGCCGGCCTCTGTGTGTGCGCCAGCAGCGAGCCGGTGTGCGGCAGCGACGCCAAC 
ACCTACGCCAACCTGTGCCAGCTGCGCGCCGCCAGCCGCCGCTCCGAGAGGCTGCACCGGCCGCCGGTCATCGTCCTGCAGCGC 
GGAGCCTGCGGCCMGGGCAGGMGATCCCMCAGTTTGCGCCATAMTATMCTTTATC 
CCTGCCGTGGTTCATATCGMTTGTTTCGCMGCTTCCGTTTTCTAMCGAGAGGTGCCGGTGGCTAGTGGG 

35 GTGTCGGMGATGGACTGATCGTGACAMTGCCCACGTGGTGACCMCMGC 

ACTTACGMGCCAAMTCMGGATGTGGATGAGAAAGCAGACATCGCACTCATCAAMTTGACCACCAGGGCAAGCTC 
CTGCTGCTTGGCCGCTCCTCAC^GCTGCGGCCGGGAGAGTTCGTGGTCGCCATCGGM 

ACCACCGGGATCGTGAGCACCACCCAGCGAGGCGGCAMGAGCTGGGGCTCCGCMCTCAGACATGGACTACATC 
. GCCATCATCMCTATGGAMCTCGGGAGGCCCGTTAGTAMCCTGGACGGTC 

40 GCTGGMTCTCCTTTGCMTCGCATCTGATMGATTAAAMGTTCCTCACGGAGTCCCATGACCGACA^ 

ATCACCMGAAGAAGTATATTGGTATCCGAATGATGTCACTCACGTCCAGCAAAGCCAAAGAGCTGAAGGACCGGCACCGGGAC 
TTCCCAGACGTGATCTCAGGAGCGTATATMTTGMGTMTTCCTGATACCCCAGCAGMGCTGGTC 
GTCATMTCAGCATCMTGGACAGTCCGTGGTCTCCGCCMTGATGTCAGCGACGTCATTAAAAG 
GTGGTCCGCAGGGGTMTGMGATATCATGATCACAGTGATTCCCGMGAMTTGACCCATAGGCAGAGGC 

45 CATGTTTCCCTCAAAGACTCTCCCGTGGATGACGGATGAGGACTCTGGGCTO 
ATTTTGTTTGTTCAGTGGAGACTCCCTGGCC 
CGCAGGCAGMGCTCTGCCCTTCTGTATCCTATGTATGCAGTC^ 
CAGCATTTGTCTCCTCCTTTMCTGAGTCATCATCTTAGTCCMCTMTGCAGTCG 
CGGGAGCCAGGATGGGACTGGTCGTGTTTGTGCTTTTCTCCMGTCAGCACCCAM 
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GAGCGCTGGCTTCTCAMCGGCCGMGTTGCCTC 
ATTAA AGTAC TTCTTAC AAA 

357 

MALETICRPSGRKSSKMQAFRIVTOVNQKTFYTjRNNQ 
5 LQLEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESMCPGWFLCTAMEADQPVSLTNMPDEGVMVTK^ 

358 

AGCTCCACCCTGGGAGGGACTGTGGCCCAGGTACTGCCCGGGTGCTACTTT^ 

AGACCTCAGAAGACCTCCTGTCCTATGAGGCCCTCCCCATGGCmAGAGACGATCTGCCGACCCTCT^ 
AGATGCMGCCTTCAGMTCTGGGATGTTMCCAGMGACCTTCTATC 
10 GACCAAATGTCAATmGAAGAAAAGATAGATGTGGTACCCATTGAGTC 
TGTGCCTGTCCTGTGTCAAGTCTGGTGATC^^ 

AGCAGGACAAGCGCTTCGCCTTCATCCGCTCAGACAGTGGCCCCACCACCAGTTTTGAGTC 

TCTGCACAGCGATGGMGCTGACCAGCCCGTCAGCCTCACCMTATC^ 

AGGAGGACGAGTAG 

15 359 

MI PNGYLMFEDENF IES S VAKLNALRKSGQFCDVRLQVCGHEMLAHRAVLACC S PYLFEI FNSDSDPHG I SHVKFDDLNPEAVE 
VLLNYAYTAQLKADKELVKDWSAAKELKMD^ 

EFLKLPRLKLEVMLEDWCLPSNGKLYTKVINWVQRSIVffiNGDSLEELMEEVQTLYYSADHKLLDGNL 
FVQKKPPRENGHKQISSSSTGCLSSPNAWQSPKHEWKIVASEKTSNNTYLCLAVLDGIFCVIFLHGRN 
20 LSFEMQQDELIEKPMSPMQYARSGLGTAEMNGKLIMGGY^ 

GGSNGHSDDLSCGEMYDSNIDDWIPVPELRTNRCNAGVCALNGKLYIVGGSDPYGQKGLKNCDWDPVTKLOTSCAPLNIRR 

SAVCELGGYLYIIGGAESWNCLNTVERYNPEI^TWTLIAPMNVARRGAGV 

MGHMTS PRSNAGIATVGNTI YAVEDSMAMNF • 

360 

25 agtgtctcccggtcgcgggtggaggtcggtcgctcagagc 

cggcgagcgagcgagttcccgcgagttctcggtggcgctcccccttcctttcagtctccacggactggcccctcgtc 

maccgctcccgtcttctgccgccttctggcgctttccgttc 

ctagaaatgaatgtttccatctcttcagagatgaaccag 

ttaaggactoattacatcattttcm^ 
30 cacctgtcttcataccctgacgmgtgcaccttttm^ 

ccaccaacagagtagccamtttatmggaaamtgattcgcaatggatatttg 

tctgttgccaaattamtgccctgaggaamgtggccagttctg 

cacagagcagtgctagcttc£tgcagtccctatttatttgam 

amtttgatgatctcmtccagaagctgttgmgtcttgttgmttatgcctac 
35 ' gtaaaagatgtttattctgcagcaaaagagctgmgato 

gatgttaccagctgcatctcttaccgamttttgcmgttgtatgggagactcccatttgttgaatm 

caggagcatttgttacamtttctgmgaggaggagtto 

gtttgcttgcccagcmtggcamttatatacaaaggtmtcaact 

gmgagctgatggaagaggttcamccttgtactactcagctgatcacaagctgcttc 
40 gaggtgtttggcagtgatgatgaccacattcagtttgtgcagaaamgccaccacgtga 

agttcaactggatgtctctcttctccamtgctacagtacaaagccctaagcatc 

tcamtmcacttacttgtgcctggctgtgctggatggtatattc^ 

tcaccmcmgtactccaaaactmgtaagagm 

atgcagtac(k:acgatctggtctgggmcagcagagatgmtggcamctcatagctgcagg 
45 cttcgaacagtcgmtgctatmtccacatacagatcactggtcctttcttgctcc 
atggctgtactcatgggccagctctatgtggtaggtggatcamtggccactcagatgacc 
tcamcatagatgactggattcctgttccagmttgagmctmccx5ttgtmtc 
tacatcgttc&tggctctgatc^ 
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AGCTGTGCCCCTCTTMCATTCGGAGACACC^ 

TCTTGGMTTGTCTGMCACAGTAGMCGATACMTCCTGAAMTAATACCTGGAC 
CGAGGAGCTGGAGTGGCTGTTCTTMTGGAAMCTGTTTGTATGTC^ 

ATGTATGATCCMCTAGAMTGMTGGMGATGATGGGACATATGACTTCACCAAGGAGCMTGCTG 
5 MCACCATTTATGCAGTGGAGGATTCGATGGCMTGMTTTCTGMTAC 
GCCCCTATACAMGATTTTCCAGTTTTMCGGGTTTMGACCCTCTCAMC 
AGGTACACTTGTGAATAAAGAGGGTGGGTGGGGATAGATGTTGCTAACAGCM^ 
ACATGCTGTACATACTTTTTGGGTTTATTO 

TTTACTTTTTGGAAMGATAMCCMGMTTGATTGGGCACATCAAAAAA 
10 361 

MVIX5VMILPVLIMIALPSPSMEDEKPKVNPKLYMCVCEGLSCGNEDHCEGQQCFSSLSINDGFHWQ 
PSPGQAVECCQGDWCNRNITAQLPTKGKSFPGTQNFHLEVGL 1 1 LS WFAVCLLACLLGVALRKFKRRNQERLNPRDVEYGTI E 
GLITTWGDSTLADLLDHSCTSGSGSGLPFLVQRTVARQITLLECVGKGRYGEVWRGSWQGEWAVKIFSSRDEK 
NTVMLRHENILGFIASDMTSRHSSTQLWLITHYHEMGSLYDYLQLTTLDTVSCL^ , 
15 LKSKNILVKKNGQCCIADIX5LAVMHSQ 

NGIVEDYKPPFYDWPNDPSFEDMRKWCVDQQRPNIPMWFSDPTLTSLMLffiECWYQNPSARLTALRIKKTLTKIDNS 
LKTDC 

362 

GMGCGMTAGCGTTTTCAGAGATATTGGGCGGCTCMGGGTC 
20 GCCCACTGCAGCCTCCACCTCCCAGGCTCMGCAGTCCTTCCCCCCT 

TGGTAAGCGTCACACTGCCAMGTGAGAGCTGCTGGAGMCTCATMTCCCAGGMCGCCTC 

TGACCAGAGTGAGAGMGCTCTGAACGAGGGCACGCGGCTTGAAGGACTGTGGGCAGATGTGACCM 

TACMTGGTAGATGGAGTGATGATTCTTCCTGTGCTTATCATGATTGCTCTCCCCTCCCCTAGTATGGM 

GGTCMCCCCAMCTCTACATGTGTGTGTGTGMGGTCTCTCCTGCGGTMTGAGGACC 
25 CTCACTGAGCATCMCGATGGCTTCCACGTCTACCAGAMGGCTGCTTCCAGGTTTATGAGCAGG^ 

CCCGCCGTCCCCTGGCCAAGCTGTGGAGTGCTGCCAAGGGGACTGGTGTAACAGGMCATCACGGCCCAGCTGCCCACTAAAGG 

AAMTCCTTCCCTGGMCACAGMTTTCCACTTGGAGGTTG^ 

CTGCCTGCTGGGAGTTGCTCTCCGAAAATTTAAMGGCGCAACCMGAACG 

CGMGGGCTCATCACCACCMTGTTGGAGACAGCACTTTAGCAGATTTATTGGATCATTCGTGTACATC 
30 TCTTCCTTTTCOKM^ 

GAGGGGCAGCTGGCMGGGGAAMTGTTGCCGTGMGATCTTCTC^ 

GTACAACACTGTGATGCTGAGGCATGAAMTATCTTAGGTTTCATTGOTTC 

GTGGTTMTTACACATTATCATGAAATGGGATCGTTGTACGACTATCTTCAGCTTACTACT 

MTAGTGCTGTCCATAGCTAGTGGTCTTGCACATTTG^ 
3 5 AGATTTAMGAGCAAAMTATTCTGGTTMGMGMTGGACAGTC 

GAGCACCMTCAGCTTGATGTGGGGMCMTCCCCGTGTGGGCACCAAGCGCTACATGGCCCCCGM 

CCAGGTGGATTGTTTCGATTCmTAAMGGGTCGATATTOGGCCTTTGGA 

GAGCMTGGTATAGTGGAGGATTACMGCCACCGTTCTACGATC 

AGTCTGTGTGGATCMCAMGGCCAMCATACCCMCAGATGGTTCT 
40 AGMTCCTGGTATCAAMTCCATCCGCAAGACTCACAGCACTGCGTATCAAAAAGACTTTGA 

CAMTTGAAMCTGACTGTTGACATTTTCATAGTGTCMGMGGM 

GCTGGCCTGACTGGTTGTCAGMTGGMTCCATCTGT^^ 

CATGTGTTGGGGAGACATCAAAACCACCCTMCCTCGCTCGATGACTGTGMCTGGGCATTTC^ 

GACTMTGTTGGACAGACACTGTTGCAMGGTAGGGACTGGAGG 
45 GTGGCTTTGCATAG£TTTCACMGTCTCCTAGACACTCCCCACG^ 

GCCTGTGCTTCTCTTCTTTATTGCACTAGGMTTCTTTGCATTC 

GCCAAAATGTTGGCTGCGTACTCCACTGGTCTGTCTTTGGATMTAG^ 

TTGCTGCATTTTACACATGTGCTGATGTTTACAATGATC 

TACTTGTGCACTTAGTAGTTTTTACAAMC^^ 
50 GAMTGTTTTTAACACTATACTCTAAMTGGACATTTTCTTTTATTATCA 
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TGTGTGTAGACTGTAACTTTTTTTC^ 

ATTTAGAAGCAMGATTTCAGTAGMTTTTAGTCCTGMCGCTACGGGGAAM 

ATTTAMCTCTGCCAGAAAAAMTMCTATTTTC 

TGTTTTCAAGTC 

5 363 

WQVPLPLDRDGTLVRLRFTMm 

WWAITFPVFGFFFCIIWSLWHFEYTVATDCGWl^LPSVSSAIGGEVPQRYVMFCIGLHSAPRFLVAFAYWNHYL 
CYRPLCRLNFGLNVVmALL^ 

I SFFSALAVYFRHNMYC EAGVYTI FAI LEYT WLTNMAFHMTAWWDFGNKELL I TSQPEEKRF 
10 364 

GGCCTOCCMTCGCGAAMCCCGCCGTTCGCGCTCTGACCAGCCCGCAGAG 
GGGTTCCGCCGGCACTCTCGCCACCAtt 

TGGTACGGCTCCGCTTCACCATGGTGGCCCTGGTCACGGTCTGCTGTCCACTTGTC 
TGCTCTTCCACTTCMGGAGACAACGGCCACACACTGTGGGGCCACGC 
15 ACCCCGATGGGACCTTGTTCCGGCTTCGCTTCACAGCCATGGTCTC 
GCATCATCTGGTCCCTGGTGTTCCACTTTGAG 

CAGCCATCGGCGGGGAGGTGCCCCAGCGCTACGTGTGGCGTTTCTGCATCGGCCTC 
TCGCCTACTGGMCCACTACCTCAGCTGCACCTCCCCGTGTTCCTGCTATCGCCCGOT 

TCGTGGAGMCCTCGCGTTGCTAGTGCTCACTTATGTCTCCTCCTCCGAGGACTTCACCATCCACGAAMTGCTTTC 

20 TCATTGCCTCATCCCTCGGGCACATGCTCCTCACCTGCATTCTCTGGCGGTTGA 

GCMGTCCTACAGCTGGAAACAGCGGCTCTTCATCATCMCTTCATCTCCTTCTTCTCGGCGCTGGCTGTCTACTT 
ACATGTATTGTGAGGCTGGAGTGTACACCATCTTTGCCATCCTGGAGTACACTGTTC 
CGGCCTGGTGGGACTTCGGGAACAAGGAGCTGCTCATMCCTCTCAGCCTGAGGAAMGCGATTCTGM 
TGGGAGGACGCAGCCCACTGCCCAGAMCMGAAACACGATACCATTCTGGCCTTCCCCACCCC^ 

25 TGMGATGGGGGAAGGGTAAGMGGMGGGTGTAGGCCAAGGCTCACCCCAGTGCTGCTGGCTTCTCCTCTCCACCCCTCATAT 
GGGCGTGGGGTCCTCAMCATCACCTTTACCTGAGAGGCCCCM 

AAAAAAMCGTCCTGAGGTTCATGACCACCATCCAGTTTCTGGCCTTTACACAGTCACCTTTCACTGAGGTCAGGAGCCCCTG^ 
GCAGTGGCTGCTCCCTGACMCCACAGCCATTTCTC 

ACATCCAGGCCTGTGGCCACAGTCCCCTGCTAMGTTGCTCAGGTGTTCTAGTCCTGA 
30 CCTAGGGTATGTACCCTTCCCCATCTGAGCCTCGGTGTGTCCATGTGTCTC 

ACTCTACCAGTCAGTGGTTCTGATGTCATCGGGTGGAGGTGGT^ 

GCACAGCATGTATTTTCTTCTCTTCTGAMGTTCTGGCmTAGACCCCTCCCCTCCTTO 

AGATGCAGATCTCTACTGTAAMTGGGCTCCCTGGTATCTC 

TTTATTTTGAAAGGAAMTAMTTTTTTTTTTGG 
35 AAAAAAAAAAAAAAAAAAAAAAAA 

365 

MMLSLNNLQNI IYNPVI PFVGTI PDQLDPGTLIVIRGHVPSDADRFQVDLQNGSSMPRADVAFHFNPRFKRAGCIVCNTLINE 
KWGREEITYOTPFQKEKKSFEIVIMVLKMFQVAVNGKH^ 
TEI SRENVPKSGTPQLRLPFAARLNTPMGPGRTVW^ 
40 EERNITSFPFSPGMYFEMIIYCDTOEFKVAVNGVHSLEYKHRFKELSSIDTLEINGDIHLLEVRSW 

366 

ACACAGMGAGACTCCMTCGACMGMGCTGGAAMGMTGATGTTGTCCTTAMCMCC 
GTAATCCCGTTTGTTGGCACCATTCCTGATCAGCTGGATOT 
GACAGATTCCAGGTGGATCTGCAGMTGGCAGCAGCATGAMCCTCG 
45 AGGGCCGGCTGCATTGTTrGCMTACTT-TGATAMTGAAAMTGGGGACGGGM 
GAGAAAMGTCTTTTGAGATCGTGATTATGGTGCTGMGGCCAAATTCCAGG 
GGCCACAGGATCGGCCCAGAGAAMTAGACACTCTGGGCATTTAT 

TCGGACTTACAAAGTACCCAAGCATOTAGTCTGGMCTGACAGAGATMGTAGAGAAMTGTTCC 
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CTTAGGCTGCCATTCGCTGCMGGTTGMCACCC 
GCCAAMGCTTTAATGTTGACCTACTAGCAGGAAAATCAMGGATATTG 
TTTGTAAGAMTTCTTTTCTTCAGGAGTCCTGGGGAGMGMG 
TTTGAGATGATAATTTATTGTGATGTTAGAGMTTC 
5 AMGAGCTCAGCAGTATTGACACGCTGGAMTTMTGGAGACATCCACTTACTGGMGTAAGGAGCTGGTAGCCTACCTACACA 
GCTGCTACAAAAACCAAMTACAGMTGGCTTCTGTGATACTGGCCTTGCTGAMCGCATCTC 
TTGTTAAAATGACCT 

367 

MAALTRDPQFQKLQQWYREHRSELNLRRLFDAMDRFNHFSLTLNTNHGHILVDYSKNLW 
10 FNGEKINYTEGRAVLHVALRNRSNTPILVDGKD^ 

EALKPYSSGGPRVWYVSNIDGTHIMTLAQLNPESSLFIIASKTFTTQETITNAETAKEWFLQAAKDPS 
VKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIAL^^ 

MLP YDQYLHRFMYFQQGDMESNGKYITKSGTRVDHQTG PI VWGEPGTNGQHAF YQLI HQGTKM I PCDFLI PVQTQHPIRKGL- 
HHKILLANFLAQTEALMRGKSTEEARKELQAAG 
15 WDINSFDQWGVELGKQLAKKIEPEIJDGSAQVTSHDASTNGLINFIKQQREARVQ 

368 

CTCGAGAGCTCCGCCATGGCCGCTCTCACCCGGGACCCCCAGTTCCAGAAGCTGCAGCAATGGTACCGCGAGCACCGCTCCGAG 

CTGMCCTGCGCCGCCTCTTCGATGCCMCAAGGACCGCTTCMCCACTTCAGCTTC 

CTGGTGGATTACTCCAAGMCCTGGTGACGGAGGACGTGATC 
20 GCCCGGGAGCGGATGTTCMTGGTGAGMGATCMCTACACCGAGGGTCG 
• MCACACCCATCCTGGTAGACGGCAAGGATGTGATGCCAGAGGTCMCAA 

GTCCGGAGCGGTGACTGGMGGGGTACACAGGCMGACCATCACGGACGTCATCMCATTGGCATTGTCGGCTC 

CCCCTCATGGTGACTGAAGCCCTTAAGCCATACTCTTCAGGAGGTCCCCGCGTCTGGTATGTCTCCMCATO 

ATTGCCAAMCCCTGGCCCAGCTGAACCCGGAGTCCTCCCTGTTC^ 
25 ACGAATGCAGAGACGGCGMGGAGTGGTTTCTCCAGGCGGCCAAGGATCCTTC 

ACTAACACMCCAAAGTGMGGAGTTTGGAATTGACCCTCAAAA 

CTGTGGTCGGCCATCGGACTCTCCATTGCCCTGCACGTGGGTTTTGACMCTTCGAGCAGC 

GACCAGCACTTCCGCACGACGCCCCTGGAGMGAACGCCCCCGTCTTGCTGGCGCTGCTGGGTATCT^ 

GGGTGTGAGACACACGCCATGCTGCCCTATGACCAGTACCTGCACCGCT^ 
30 MTGGGAMTACATCACCAMTCTGGMCCCGTGTGGACCACCAGACAGGCCCCATTGTGTGGGGGGAGCC 

CAGCATGCTTTTTACCAGCTCATCCACCMGGCACCMGATGATACCCTGTGACTTCCTCA 

ATACGGMGGGTCTGCAT«CMGATCCTCCTGGCCMCTO 

GAGGCCCGAMGGAGCTCCAGGCTGCGGGCMGAGTCCAGAGGACCTTGAGAGGCTGCTGCCACATAAGGTCTTTC 
CGCCCAACCMCTCTATTGTGTTCACCAAGCTCACACCATTCATGCTTGGAGCCT^ 
35 GTTCAGGGCATCATCTGGGACATCMCAGCTTTGACCAGTGGGG 
GAGCTTGATGGCAGTGCTCMGTGACCTCTCACGACGCTTCTAC^ 
AGAGTCCMTAAACTCGTGCTCATCTGCAGCCTCCTCTGTGACTCCCCTTTCTCTTCTCGTC 

GCATGTTCCTGGACACCACCCAGAGCACCCTCTGGTTGTGGGCTTGGACCACGAGCCCTTAGCAGGGAAGGCTGGTCTCCCCCA 
GCCTAACCCCCAGCCCCTCCATGTCTATGCTCCCTCTGTGTTAGMTTGGCTGMGTGTTm 
40 CATGTTCACGTTGTTCACATCCCATGTAGAAAAACAAAGATGCCACGGAGGAGGT 

369 

MKKEHVLHCQFSAWYPFFRGVTIKSVILPLPQNVKDYLLDDGTLWSGRDDPPTHSQPDSDDEAEEIQWSDDEMTAT 
EFATKVQEPINSLGGSVFPKLNWSAPRDAYW^ 

WCELJPGAEFRCFWENKLIGISQRDYTQYYDHISKQKEEIRRCIQDFFKKHIQYKFLDEDFVFDIYRDS 
45 VTDSLLFTWEELISENNLNGDFSEVDAQEQDSPAFRCTNS 

370 

TGCGTTTAGGGCGAAGACGGAGTTGTAMCTTCTTAAMTTCCTCTCTCGACAC 
CTTTTTGTATGCGTGTGTGTCMGCGTATGCCCCGGGAT 
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cggaagagtccttgmccaaaatagctcggcgggcacto 
gamggcagcagcggcggcagctggaggatgmgmggagcatgtgcttcactgccagttctc 
aggcgttaccatcmgagtgtcattcttccacttcctcagaatgtgm 
aggmgggatgatccaccmcacattctcagccagacagtgatgatgmgcagaagam^^ 
5 agccacgcttacggcaccagaatttcctgagtttgccactamgtccag^ 

tmgcttmttggagtgccccmgggatgcgtattggatagcmtgmtagttctctg 
tctgcttttcmgagttccgatttcatc^ 
tatagaatatgagctcgttctccgaamtggtgtgmttgattcctggggc 
tattggtamctcamgagactacacacmtactatgatcatatttctamcaaaaggm 
10 ctttttcmgamcacatacagtacamttc™ 

gctcattgactttmtccatttggtgmgtcacagattcactgctgttc 

cggcgattttagtgmgttgacgctcmgagcaggattccccagctttccgttgcacamcag 

cccctatttgagttaccggctacccmggactttgtagacctctct 

gctgmgagamtcagcaggaggacgactgatgagcgtactgtmctggagmgaggaggccccgccccaccgctccgggagct 
15 * gctcatcagccgcmcttcctgccgaccctgatgcgggtgggccgagcagtgtggacatcagccacttt^ 
ttcacctggggaaaaaaacggagggactttgctacttc^ 
gttctgataataaaatgctttctatgaaat 

371 

MERIPSAQPPPACLPKAPGLEHGDLPGMYPAHMQWKSRRGIKRSEDSKETYKLPHRLIEKKRRDRINECIAQLKDLLPEHLK 
20 LTTLGHLEKAWLELTLKHVKAbTNLIDQQQQKIIALQSGLQAGELSGRNVETGQEMFGS 
SQLVTHLHRWSELLQGGTSRKPSDPAPKVMDFKEKPSSPA 

DLRSEQPCFKSDHGRRFTMGERIGAIKQESEEPPTKKNRMQLSDDEGHFTSSDLISSPFLGPHPHQPPFCLPFYLIPPSATAYL 
PMLEKCWYPTSVPVLYPGLNASAMLSSFMNPDKISAPLLMP 

372 

25 GGACACCGGGCCATGCACGCCCCCMCTGMGCTGCATCTCAMGCCGMGATTCCAGCAGCCCAGGGGATTTCAMGAGCTCA 
GACTCAGAGGMCATCTGCGGAGAGACCCCCGMGCCCTCTCCAGGGCAGTCCTCATCCAGACGCTCCGCTAGTGCAGACAGGA 
GCGCGCAGTGGCCCCGGCTCGCCGCGCCATGGAGCGGATCCCCAGCGCGCAACCACCCCCCGCCTGCCTGCCCAAAGCACCGGG 
ACTGGAGCACGGAGACCTACCAGGGATGTACCCTGCCCACATGTACCAAGTGTACAAGTCAAGACGGGGAATAAAGCGGAGCGA 
GGACAGCAAGGAGACCTACAMTTGCCGCACCGGCTCATCGAGAAAMGAGACGTGACCGGATTM 

30 GAAGGATCTCCTACCCGMCATCTCAMCTTACMCTTTGGGTCA 

TGTGAMGCACTMCAAACCTMTTGATCAGCAGCAGCAGAAAATCATTGCCCTGCAGAGTGGTTT 
AGGGAGAMTGTCGAAACAGGTCMGAGATGTTCTGCTCAGGTTTCCAGA^^ 

GCACGAGAACACTCGGGACCTGAAGTCTTCGCAGCTTGTCACCCACCTCCACCGGGTGGTCTCGGAGCTGCTGCAGGG^ 

CTCCAGK3MGCCATCAGACCCAGCTCCCAMGTGATGGA 
35 TGGGAAAMCTGCGTGCCAGTCATCCAGCGGACTTTC^^ 

TGGCTATGGAGGAGAATCGGAGMGGGCGACTTGCGCAGTGAGCAGCCGTGCTTC 

GGGAGAMGGATCGGCGCAATTMGCMGAGTCCGMGMCCCCCCACAAAAMGMCCGGATGCAGCTTO 

CCATTTCACTAGCAGTGACCTGATCAGCTCCCCGTTCCTCGGCCCACACCCAC^ 
. GATCCCACCTTCAGCGACTGCCTACCTGCCCATGCTGGAGAAGTGCTGGTATCCCACCTCAGTGCCAGTGCTATACCCAGGCCT 
40 CAACGCCTCTGCCGCAGCCCTCTCTAGCTTCATGMCCCAGACMGATCTCGGCTC 

TCCCTTGCCAGCTCATCCGTCCGTCGACTCTTCTGTCT^ 

AGACTAMCTCTCTAGGGGATCCTGCTGCTTTGCTTTCCTTCCTCGCTACTTC 

CTGCMGATTGTTGCATTGTGTATACTGAGATMTCTGAGGCATGGAGAGCAGATTC 

GTGTGTATGTGCGTGTGCGTGCACATGTGTGCCTCCGTGTTGGTATAGGACTTTAM 
45 AGGATTGCTTGACATCAGGAGACTTGGGGGGGATTGTAGCAGACGTC 

ACACATCAGCTGGATTTTCAAMGCTTCAMGTCTTGGTCTC 

CAGMGGTACTTTCAAMGAGGGCTTTCCAGGGCTCAGCTCCCMCCAGCTGTT^^ 

CGTGACTCACCAGACGTCGGGGAGAGAGAGCAGTCAGACCGAGCTTTC^ 

CGGTAGTGGTTTGGGGAGGTTTCCGCAGGTCTGCTCCCCACCCCTGCCTCGGMG 
50 CTTTCGTAGTGATTAGCTTACTMGGMCTGAAAATGGGCCCCTTGTACAAGCTGAGCT^ 
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TGGGCTTCTGGCACCTGTTTCT^ 

CCCGAGCGAGCACCCCAMGTGCACAAAGCTGAGTAAAMGCTGCCCCCTTCAMCAGM 
CTAAAACTCCTTTTMCCMGCTTAGCTTC^ 

tcttgcccmcggcatatggagtgtccttattgctaaamggattccgtctccttcamgm 
5 cttgttttcccgatgtgtccagccagctccgcagcagcttttc 
tcttttcctgtttttatttt^ 

mggaamtgtttcagatgtttatttgtatmttacttgattcacacagtgagaa 

mgmtmtttttttttctctagggag 

atctatgagtamtgttmgtagttgttttaamtacttmtaamtm 

10 373 

MAKTYDYLFKLLLIGDSGVGKTCLLFRFSE^^ 

MLVYDITNEKSFDNI K1WIRNIEEHASSDVERMILGNKCDMNDKRQVSKERGEKLAIDYG IKFLETSAKS SANVEEAFFTLARD 
IMTKLNRKMNDSNSAGAGGPVKITENRSKKTSFFRCSLL 

374 

15 ATTGGCGGGCACGCCCCCTCGCCCGCGGCCCCCTCCCCGCCTCTCTCCACCGCCTCCTCTGGCTCCCCGGTCAGAGGGCCGGAG 
CGAGMGATGGCGMGACGTACGATTATCTCTTCMGCTCCTGCTGATC 
CCGCTTCTCAGAGGACGCCTTCMCACCACCTTCATC 
AMG^y^^MGCTTCAGATATGGGACACAGCGGGTCAGGAMGATTC 

GGGCATTATGCTGGTCTATGACATCACAMTGAAAMTCCTTTGACMTATTAAAMTTGGATCAGAM 
20 CTCTTCCGATGTCGAAAGMTGATCCTGGGTMCAM^ 

GCTAGCMTTGACTATGGGATTAMTTCTTGGAGACMGCG 

ACGAGATATMTGACAAMCTCMCAGAAAMTGMTGACAGCMTTCAGCAGGAGCAGGTGGACCAGTGAA 
CCGATCAMGMGACCAGTTTCTTTCGTTGCTCGCTACTTTGATGMCTC 

TTCCTGCTTCTCTGAAAGCACAGGTCACCCAGCCTCAGMTCACACCTCCCGGCTGCTGCTGAGAGCACCACTGAACTTAGACC 
25 TCTCAACACAGTATGCCMGTGGATTCCAGCCTCATGGCCTAGCAAMGMCAGACTCCC 

GTGGAGACACATGCAGGACCTMCTCGTTTTTTCOT 

ACATCAGTGTTAGCCTTTCCCTATTTCAGCACMTCTTAGACTCATATTTGC 

TGTACATGTGGCMTGTTAAMGAGCATTTACAGCAGAGGTTMTATACTAAMTTAMGOT 

CAAATATTACTGCCTTGGTAGCATTTATTTMGGGCTTTTTC^ 
30 GCCCA . 

375 

MPGMFFSMPKELKGTTHSLLDDKMQKRRPKTFGOT 

VFGSFLKSEFSEENIEFWLACEDYKKTESDLLPCKAEEIYKAFVHSDMKQINIDFRTRESTAKKIKAPTPTCFDEAQ 
MEKDSYPRFLKSHIYLNLLNDLQANSLK 

35 376 

GGGAGTTAGACAAMTGCCAGGMTGTTCTTCTCTGCTAACCCAMGGMTTGAMGGM 
AMTGCAAAAAAGGAGGCCAMGACTTTTGGMTGGtf 

TGAMTCTTCCMGTCCMGGATGTACTTTCTGCTGCTGMGTMTGCMTGGTC 

AMCTGGTCAAAATGTCTTTGGMGTTTCCTAMGTCTGMTTCAGT^ 
40 ATAAGAAMCAGAGTCTGATCTTTTGCCCTGTAMGCAGMGAGATATATAMGC^ 

TCAATATTGACOTCCGCACTCGAGMTCTACAGCC 

TCATATATACTCTTATGGAAMGGACTCTTATCCCAGGT^ 
• • CTMTAGCCTAMGTGACTGGTCCCTGGCTGMGGGMTTAACAGATAGTATCAGCGCAGMGGM 

TGGGTGMCAGCTTGGCCTTTTTTGGGTGTC 
45 CCAGGCGCAGAGTTGMGMGCATMGCMGCMGACAAAMCAGAGAGACCGCMGGAGGMGATCTGTGGTACTG 

AMCAGTGGAGCTCTGTATTAGAAMGCCCCTCAGMCTGGGMGGCCAGGTM 

CTATCAMC^ATTACACAGACTGTAAGMTTCAAAGTCMCTGACATCTATGCTACATAT^ 

TGAGCCATTAAOTAMGCATATGTTTCAM^ 
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TTGGAAGAGATGAACTGTGTATTTMCTTM 

GTGCMTMTAMTGTGGATTTTGTATTAAAATATATATGAAGCAAAAAAA 
377 

MQIFVKTLTGKTITLEVEPSDTIENVKA^ 
5 TGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQ 
VEPSDTIEWKAKIQDKEGIPPDQQRLIFAGKQL 

NVKMIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDT 
KEGIPPDQQRLIFAGKQLEIXSRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVK^ 
QRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFWT^ 
10 QLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVE PSDTI ENVKAKIQDKEGI PPDQQRLI FAGKQLEDGRTL 
SDYNIQKESTLHLVLRLRGGMQIFVKTLTGOT 
STLHLVLRLRGGV 

378 

TAGTTCCGTCGCAGCCGGGATTTGGGTCGCGGT^ 
15 AGACTCTGACTGGTMGACCATCACCCTCGAGGTTGAGCCCAGTGACACCATCGAGMTGTCAAGGCAMGA 
MGGCATCCCTCCTGACCAGCAGAGGCTGATCTTTGCTGGAAAACAGC 
AGAMGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGTGGGATGCAM 
CCCTT6AGGTGGAGCCCAGTGACACCATCGAGAACGTCAAAGCAM6ATC 

GGTTGATCTTTGCCGGAMGCAGCTGGMGATGGGCGCACCCTGTCTGACTACMCATCCAGAMGAGT 
20 TGCTCCGTCTCAGAGGTGGGATGCAGATCTTCGTGMGACCCTGACTGGTMGACCATCACCCTC 
CCATCGAGAATGTCMGGCAMGATCCMGATMGGMGGCATTCCTCC 
TGGMGATGGTCGTACCCTGTCTGACTACMCATC 

AMTCTTCGTGMGACACTCACTGGCAAGACCATCACCCTTGAGGTCGAGCCCAGTGACACTATCGAGMCGTCAAAGCAAAGA 
TCCMGACAAGGAAGGCATTCCTCCTGACCAGCAGAGGTTGATCTTTGCCGGAMGCAGCTGGMGATGGGCGCA 
25 ACTACMCATCCAGAMGAGTCTACCCTGCACCTGGTGCTCCGTC 
GTMGACCATCACCCTCGMGTGGAGCCGAGTGA^ 

CTGACCAGCAGAGGTTGATCTTTGCCGGAAMCAGCTGGMGATGGTCGTACCCTGTCTGA 

CCTTGCACCrGGTGCTCCGTCTCAGAGGTG^ 

AGCCGAGTGACACCATTGAGMTGTCMGGCAAAGATCCMGAC^^ 
30 CTGGGAMCAGCTGGMGATGGACGCACCCTGTCTGACTACMCATCCAGAAAGAGTCCACCCTGCACCTGGTGCTCCGTCTTA 

GAGGTCK3GATGCAGATCTTCGTGMGACCCTGACTGGTMGACCATC 

TCMGGCAMGATCCMGACMGGMGGCATCCCTCCTC^ 

GCACCCTGTCTGACTACMCATCCAGAMGAGTCCACCCTGCACCTGGTC 

AGACCCTGACTGGTMGACCATCACTCTCGAAGTGGAGCCGAGTGACACCATTGAGMTGTCAAGG 
35 MGGCATCCCTCCTGACCAGCAGAGGTTGATCTTT^ 

AGAMGAGTCCACCCTGCACCTGGTGCTCCGTCTCAGAGGTGGGATGCAGATCnCGTGMGACCCTGAC 

CCCTCGAGGTG&AGCCCAGTGACACCATCGAGAATGTCMGGCAMGATC^ 

GGTTGATCTTTGCTGGGAMCAGCTGGMGATGGACGCACCCTGTCTGACTAC 

TCCTGCGCTTGAGGGGGGGTGTCTMGTTTCCCCTTTTM 
40 GCATTCCC ■ 

379 . ' 

mgsrtpbsplhavqlrwgprrrpplvpllialvpppprv^ 

TSQPGVLQGGAVYLCPWGASPTQCTPIEFDSKGSRLL 

LSDPVGTC YLSTDNFTRI LEYAPCRSDFSWAAGQGYCQGGF saeftktgrwlggpgsyfwqgqi lsatqeqi ae s yypeylin 
45 LVQGQLQTRQAS S I YIIDSYLGYSVAVGEFSGDDTEDFVAGVPKGNLTYGYVTI LNGSDIRSLYNFSGEQhlAS YFGYAVAATDVN 
GIX3LDDLLVGAPLLMDRTPIX3RPQEVGRVYVYL^^ 
QQC^A/FWPGGPGGLGSKPSQVLQPLWMSHTPDFFGSALR^ 

MFNPEERSC SLEGNPVAC INLSFCLNASGKHVADS IGFTVELQLDWQKQKGGVRRALFLASRQATLTQTLLIQNGAREDCREMK 
IYLRNESEFRDKLSPIHIALNFSLDPQAPVDSHGLRPALHYQSKSRIEDKAQILLIXX5EDNICVPDLQLEVFG 
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ALNLTFHAQNVGEGGAYEAELRVTAPPEAEYSGLVRH PGNFS SLSCDYFAVNQSRIjLVCDLGNPMKAGAS LWGGLRFTVPHLRD 
TKKTIQFDFQILSKl^OTSQSDWSFRLSVEAQAQWLNGVSKPEAVLFPVSDWHPRDQPQKEEDLGPAVHHW 
SQGVLELSCPQALEGQQLLYVTRVTGLNCTTNHPINPKGLELDPEGSLHHQQKREAPSRSSASSGPQILKCPEA 
PLHQQESQSLQLHFRWAKTFLQREHQPFSLQCEATO<ALKMPYRILPRQLPQKERQVATAVQWTKA 
5 GLLLLGLLIYILYKLGFFKRSLPYGTAMEKAQLKPPATSDA 

380 

CAGGACAGGGMGAGCGGGCGCTATGGGGAGCCGGACGCCAGAGTCCCCTCTC 
GCCGACCCCCGCTCGTGCCGCTGCTGTTGCTGCTC^ 
CAGCAGTACTCTCGGGGCCCCCGGGCTOT 
10 TGGTGGGAGCACCCAAGGCTAATACCAGCCAGCCAGGAGTGC^ 
CCACACAGTGCACCCCCATTGMTTTGACAGCAAAGGCTCTC 

CTGTGGAGTACMGTCCTTGCAGTGGTTCGGGGCMCAGTTCGAGCCCATGGCTCCTCCATC 
GCTGGCGCACAGAGMGGAGCCACTGAGCGACCCCGTGGGCACCTC 

ATGCACCCTGCCGCTCAGATTTCAGCTGGGCAGCAGGACAGGGTTACTGCCMGGAGGCTTCAGTC 
15 GCCGTGTGGTTTTAGGTGGACCAGGMGCTAm 

ATTACCCCGAGTACCTGATCMCCTGGTTCAGGGGCAGCTGCAGACTCGCCAGGCCAGTTCCATCTATGATGACA 

GATACTCTGTGGCTGTTGGTGAATTCAGTGGTGATGACACAGMGACT^ 

GCTATGTCACCATCCTTMTGGCTCAGACATTCGATCCCTCTACMCTTCTCAGGGGMCAGATG^ 

CAGTGGCCGCCACAGACGTCMTGGGGACGGGCTGGATGACTTGCTGGTGG^ 
20 GGCGGCCTCAGGAGGTGGGCAGGGTCTACGTCTACCTGCAGCACCCAGCCGGCATAGAGCCCACGCCCACCCTTACCCTCACTG 

GCCATGATGAGTTTGGCCGATTTGGCAGCTCCTTGACCCCCCTGGGGGACCTGGACCAGG 

GGGCTCCCTTTGGTGGGGAGACCCAGCAGGGAGTAGTGTTTGTATT^ 

AGGTTCTGCAGCCCCTGTGGGCAGCCAGCCACACCCCAGACTTCTTTGGCTCTGCCCTTCGAGGAGGCCGAGACCTGGATGGCA 
ATGGATATCCTGATCTGATTGTGGGGTCCTTTGGTGTGGACMGGCTGTGGTATACAGGGGCCG^ 
25 CCTCCCTCACCATCTTCCCCGCCATGTTCMCCCAGAGGAGCGGAGCTGCAGCTTAGAGGGGAACCCTGTGGCCTGCATCAACC 
TTAGCTTCTGCCTCMTGCTTCTGGAAMCACGTTGCTC 

AGMGGGAGGGGTACGGCGGGCACTGTTCCTGGCCTCCAGGCAGGCAACCCTGACCCAGACCCTGCTCATCCAGAATGGGGCTC 
GAGAGGATTGCAGAGAGATGMGATCTACCTCAGGMCGAGTCAGMTTTCGAGACAAACTCTCGCCGATTCACATCGCTCTCA 
ACTTCTCCTTGGACCCCCAAGCCCCAGTGGACAGCCACGGCCTCAGGCCAGCCCTACATTATCAGAGCAAGAGCCGGATAGAGG 
30 ACMGGCTCAGATCTTGCTGGACTGTGGAGMGACMCATC^ 
ATGTGTACCTGGGTGACAAGMTGCCCTGAACCTCACTTT^ 

TTCGGGTCACCGCCCCTCCAGAGGCTGAGTACTCAGGACTCGTCAGACACCCAGGGAACTTCTCCAGCCTGAGCTGTGACTACT 
TTGCCGTGAACCAGAGCCGCCTGCTGGTGTGTGACCTGC^CAACCCCA 

TTACAGTCCCTCATCTCCGGGACACTMGAAMCCATCCAGTTTGACTTCCAGATCCTCAGCMGMTCTCAACAACTCGCAAA 
35 GCGACGTGGTTTCCTTTCGGCTCTCCGTGGAGOT ' 

TCCCAGTMGCGACTGGCATCCCCGAGACCAGCCTCAGMGGAGGAGGACCTGGGACCTGCTGTCCACCATGTCTATGAGCTCA 

TCMCCMGGCCCCAGCTCCATTAGCCAGGGTGTGCTGGMCTCAGCTGTCCCCAGGCTCTGGMGGTCAGCAGCTCCTATATG 

TGACCAGAGTTACGGGACTCMCTGCACCACCMTCACCCCATTAACCCAAAGGGCCTGGAGTTGGATCCCGAGGGTTCCCTGC 

ACCACCAGCAAAAACGGGAAGCTCCMGCCGCAGCTCTO 
40 TCAGGCTGCGCTGTGAGCTCGGGCCCCTGCACCMCMGAGAGCCAMGTCTGCAGTTGCATTTC 

TCTTGCAGCGGGAGCACCAGCCATTTAGCCTGCAGTC 

GGCAGCTGCCCCAAAMGAGCGTCAGGTGGCCACAGC^^ 

TCATCATCCTAGCCATCCTGTTTGGCCTO^ 

CCCTCCCATATGGCACCGCCATGGAAAMGCTCAGCTCMGCCTCCAGCCACCTCTGATGCCTGAGTO 
45 CCCATTCCTCMGAACCAGTCCCCCCACCCTCAT^ 

GGAGMGCTCCTCTCCCCAGCCCAGAGACATACTTGMGGGCCAGAGCCAGGGGGGTGAGGAGCTGGGGATCCCTCCCCCCCA^ 

GCACTGTGAAGGACCCTTGTTTACACATACCCTCTTCATGGATGGGGGMCTCAGATC 

AGCCTTTGCATTTTGGAGAGTTTCCTGAMCAACTGGAMGATM 

ACMGGACTTCCTGTCCAGCTCCAACCTGCAAAGATCTGTCCTCAGCCTTGCCAGAGATC 
50 TGGMCTTGGGGAGTTMGACCTGGCAGCTCTGGACAGCCCCACCCTGGTGGGC^ 

CCAGGACCAGCTCAGGACAGATGCCACMGGATAGATGCTGGCCCAGGGCCAGAGCCCAGCTCCAAGGGGAATCAGAACTCAAA ' 
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TGGGGCCAGATCCAGCCTGGGGTCTGGAGTTGATCTGGAACC^ 

tatttgggcctcx:tccagacctgatcctggaggcccagttcac 
ggccatgatggcmcagatctggaacctcagcctggccagacacaggccctccctgttc 
ctgggcctgcagmtttgggttctgcctgccagc^ 
5 caccagacacccaggacttatttamctctgttgcmgtgcm 

AAAA 
381 

MDHTEGLPAEEPPAMPSPGKFGERPPPmTREAMRNYLK^^ 

EQMERDGCSEQESQPCAFIGIGNSDQEMQQLNLEGKNYCTAKTLYISDSDKRKHFIFSVKMFYGNSDDIGW 
10 KKKQSLKNADLCIASGTKVALFNRLRSQTVSTRYLHVEGGNFHASSQQWGAFFIHLLDD 
SVTGMLPRLIIMKVDKHTALLDADDPVSQLH^^ 

EYTFYEGMGPVLAPWPVPV\^SLQLNGGGDVAMLELTGQNFTPNLRVWFGDVEAETMYRCGESMLCW 
PVQVPmVRTO&IIYSTSLTFTYTPEPGPR 

382 ..... 

15 atcccctccggttttcctcagtctccacgtacgtccctcamgcgcgtcctaamcccggataaccggagcgctccccatggac 

cacacggagggcttgcccgcggaggagccgcctgcgcatgctcca 

cttactagggmgctatgcgamttatttaaaagagcgaggggatcamcagtacttattcttcatgc 

tcatatggamtgaaaamggtttttttgcccacctccttgtgtatatcttatgggc 

atggmcgcgatggttgttctgmcmgagtctcmccgtgtgcamattgggatagg 
20 ctamcitcgmggamgaactattgcacagccaamcattc^ 

gtamgatgttctatggcmcagtgatgacattggtgtgttcctcagcmgcggataamgtc^ 

mgcagtcattgaaamtgctgacttatgcattc^ 

agtaccagatacttgcatgtagmggaggtmttto ' 

gatgatgmtcagmggagmgmttcacagtccgagatgtctacat 
25 actggcatggcactcccmgattgatmttatgamgttgatmgcataccgca™ 

ctccatamtgtgcattttaccttmggatacagaaagmtgtatttgtgcctttctc • 

actccatgtccaaaagmccamtamgagatgatamtgatggccx:ttcctggacmtc^ 

acattttatgagggaatgggccctgtccttgccccagtcactcctgtgcctgtggtagaga 

gacgtagcmtgcttgmcttacaggacagmtttca 
30 tacaggtgtggagagagtatgctctgtgtcgtc^ 

caggttccagtaactttggtccgamtgatggm^ 

cggccacattgcagtgtagc^ggagcmtccttccagccmttcmgccaggtgccccctm 
ggaagttacacamcgccagcacamttcaaccagtgtcacatcatctacagccacagtggtatccta 

;383 

35 MEESEPERKRARTDEVLPEEAAPRRKMRTTRTTCP 

SLLDQHQHLKEKAEARKESMEKQLKEEEKILESVAEGRALM^ 

i.VEGDGIPPPIKSFKElOTPAAIIiRGLKKKGIHHPTPIQIQGIPTILSGRDMIGIAFTGSGKTLVFTLPVIMFCLEQEKRLPFS 
KREGPYGLIICPSRELARQTHGILEYYCRLLQEDSSPLLRCALCI 

CRYLALDEMRMIDMGFEGDIRTIFSYFKGQRQTLLFSATMPKKIQNFMSALVKPVTINVGRAGMSLDVIQ 
40 WLLECLQKTPPPVLIFAEKKADVDAIHEYLLLKGVEAVAIHGGKDQEERTKAIEAFREGKKDVLV 
NYDMPEEIENYVHRIGRTGRSGNTGIATTFINKACDESVLMDLKALLLEAKQ 
HRITDCPKLEAMQTKQVSNIGRKDYLAHSSMDF 

384 

AATGGAGGAGTCGGAACCCGAACGGMGCGGGCTCGCACCGACGAGGTGCTGCCGGAGGAAGCCG 
45 GACGACGAGGACTACGTGCCCTATGTGCCGTTACGCAGCGCCGCAGCTAC 

AGCTGCGGAGGAAGAGCAGCAGGACAGCGGTAGTGMCCCCGGGGAGATGAGGACGACATCCCGCTAGGCCCTCAGTCCM 

CAGCCTCCTGGATCAGCACCAGCACCTTAMGAGAAGGC^^ 

AGAGMGATCCTGGAGAGTGTTGCCGAGGGCCGAGCATTGATGTC 
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catcaamccagctggactccaccccg™ 
cctggtgk5agggagacggtatcccaccacccatcmgag 
gaagamggcattcaccacccaacacccattcagatccagggcatccccaccattctatc 
tttcacgggttcaggcmgacactggtgttcacgttgcccgtcatcatgtt 
5 amgcgcgaggggccctatggactcatcatctgcccctcgcgggagctggcccggcagacccatggcatcctg 

ccgcctgctgcaggaggacagctcaccactcctgcgctgcgccctctgcattgggggcatgtccgtgaaagagcagatggagac 
catccgacacggtgtacacatgatggtggccaccccggggcgcctcatggatttgctgcagmgm 
ctgtcgctacctggccctggacgaggctgaccgcatgatcgacatgggcttcgagggtgacatccgtaccatct^ 
cmgggccagcgacagaccctgctcttcagtgccaccatgccgmgmgattcagaactttgct 
10 tgtgaccatcmtgtggggcgcgctggggctgccag 

ggtgtacctgctcgagtgcctgcagaagacacccccgcctgtactcatctttc^ 
cgagtacctgctgctcaagggggttgaggccgtagccatccatgggggcaaagaccaggaggmcggactm 

ATTCCGGGAGGGCMGMGGATGTCCTAGTAGCCACAGACGTTGCCTCCMGGGCCTGGACTTCCCTGCCA^ 

CAATTATGACATGCCAGAGGAGATTGAGAACTATGTACACCGGATTGGCCGC^ 
15 TACCTTeATCAACAAAGCGTGTGATGAGTCAGTGCTGATGGACCTCAAAGCGCTGCTGCTAGMGCCM 

CGTGCTGCAGGTGCTGCATTGCGGGGATGAGTCCATGCTGGACATTO 

TCATCGGATCACTGACTGCCCCAMCTCGAGGCTATGCAGACCMGCAGGTCAGCMCATCG^ 

CAGCTCCATGGACTTCTGAGCCGACAGTCTTCCCTTCTCTCCMGAG 

AGCAGCCCCCTGGACAGAATCAGCATTTCAGCTCAGCTGGCCTGGMTG 
20 CTTCAGMTTACTGTTTTTGTTTCCTTTTAC^ 

385 

MGKGDPNKPRGKMSSYAFFVQTCREEHKKKHPDSSVNFAEPSKKCSERMTMSAKEKS 

KKGKKKDPNAPKRPPSAFFLFCSEHRPKIKSEHPGLSIGDTMK^ 

EAGKKGPGRPTGSKKKNEPEDEEEEEEEEDEDEEEEDEDEE 

25 386 

gatgtggcccgtggcctagctcgtcmgttgccgtggcgcggagmctctgcaa^ 

aaccagttcacgccggagccccgtgagggaagcgtctccgttgggtccggccgctctgcgggactctgaggaaaagctcgcacc 

aggtggacgcggatctgtcaacatgggtamggagaccccmcmgccgcggggc^aaatc 

gacctgccgggmgagcacmgmgamcacccggactcttccgtcmtttcgcggmto 
30 gmgaccatgtctgcamggagmgtcgmgtttgmgatatc^ 

ttacgttcctcccamggtgatmgaaggggaagaaamggaccccaatgctcctaamggccaccatctgccto 

ttgctctgmcatcgcccamgatcaamgtgmcaccctggcctatccattggggatactgcamgaaa 
. gtctgagcagtcagccamgatamcmccatatgmcagamgcagctmgctamggagamtatgaam 

atatcgtgccmgggcaamgtgaagcaggaaagaagggccctggcaggccaacaggctcaaagaagaagaacgaaccagaaga 
35 tgaggaggaggaggaggmgaagaagatgmgatgaggaggmgaggatgmgatgmgmtaaatggctatc 

cgtgtggmtgtgtgtgtgtgctcaggcaattattt 

amctgtacagatttttgtatagctgatmgattctctgtagagaaaatacttttaaaam 
gctactcatacagttagattttacagcttctgatgttgmtgttcctamtatttmtg^ 
tagcacagcamcttgtaggmttagtatcmtagtamttttgggttttttaggatgttgc 
40 tttgtmtaamttatgtatattatttctattc 
gaccaaaaaaaaaaaaa 

387 

MRSRLLLSVAHLPTIRETTEEMLLGGPGQEPPPS 

ITARFSGQQPTLPMADTVDPLDWLFGESQEKQPSQRDLPRRTGPSAGLWGPHRQMDSSKPMGAPRGRI^ 
45 GQQSSDLRSWTFGQSAQAMASRHRPRPSSVLRTLYSHLPVIHEL 

388 

CACACTGCTCAGGGAAGAGCCTGCTACGGTGGACTGTGAGACTCAGTGC^ 
CTGCCGGACCCTCCACCCTTCGGCCCCCMGCTTCCCAGGGGC 
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CACCCCCAGACCTCCTCAGCTCCAGGTTGCCACCTCCTCTCGCCAGAGTG 
TGCCCACMTTCGGGAGACCACGGAGGAGATGCTGCTTGGGGGTCCTGGACAGGAGCCC 
ACGTGAGGTCTATATCTCGACTGGCACAGCCCACCTCTGTGCTGGACMGGCCACGGCCCAGGGCCMCC 
GGCC^GCCCAGGCCTGCCGGMGGGCCGCCCTGCTGTGTCCCTGCGAGACATCACCGCACGTTTCAGT^ 
5 TGCCCATGGCTGATACTGTGGACCCCCTGGACTC^CTTTTTGGGGAGTC^ 
GGAGGACTGGCCCCTCTGCTGGCCTCTGGGGTCCACA 

TCTGTGAAGCCAGGATGCCTGGGCATTCCCTGGCMGACCACCGCAGGATGGGCAGCAGAGCTCTC^ 
TTGGGCAGTCTGCCCAAGCCATGGCCTCCCGCCACCGCCCCCGCCCCAGCAGTGT^^ 
TGATCCATGAACTCTGACCCCTCCCCAGTAMGGCTTCTGTAGAGAGCATGCTGGGTCTGCATCTC 
10 TGGTCACTGCCCCTGGCAGGTCTCTGAMGGGAMTGCTTTTCTGCA 

CCCTCTGMTATGATAACAGCCTGTTTCACATGAGGAGATGTTACCMTCCCGTTCGCTCTGACC 
GAGCMCTTACTTMCATCTGTGTTCCTCAGTTTCT^ 

TGACGTGTACCGCATATAGGAGCCCACTGGCTGAGTAGCTACCATCATCGCTGGTGGGGAMC 
AGTGGGGGTGTCAGCCCCCAGGTGTTTCAGMCMGGCCTC^ 
15 ATGTTCTGTGAGGCCCAAGAGMCACATGGAGTCTTAGCAAATC 

GGCACTCTGCTGGCTACMCTCATACGTCCTGTGGTGGCATTGGGAGAGTTCC 

CACTCAGTGCTACCATCCCCACCCCTACACCAOTCCACACAGGGGCCTCATGGCATGGTCAGGGTCCCAGCTGTG^ 
CAGGGCACTGTCCAGCTGTCCACTGGGGMGTCMGATGTCCTMGGCCCAGGCCAGGGCATCTGGAGTCTGAAGGA 
TCCTAGAGGCATCTGGCAGCMGMGGTGAGGCATCAGGGMCGGGMTCAGGCTGGGACTGATCAGAGGTGMGGGACAG 
20 GAGGAGAGGAGGMGATTGAGCTGGGGCMCAGCCMGCTCACCTGGCAGGTCTCTGCCACCTCCTTCTCTGTGAGCTGTCAGT 
CTAGGTTATTCTCTTTTTTTGTGGCTATTTTTAATTGCTTTG 
CTGGAGATAGMTGTAMCCATATTAAMGGAAAMGTTTCAGACM 
TCAGCMTGTTCTTCACACCCTGCMGGGCTGGGMTTGCGCATGTGMCTTC 
TTTGTATCTGTCAC 

25 389 

MQSLMQAPLLIALGLLLATPAQAHLKKPSQLSSFSWDNCDEGKDPAVIRSLTLEPDPIWPGNVTLSWGSTSVPLSSPLKVDL 
v VLEKEVAGLWIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPKSEFWPDLELPSWLTTG^ 
RIESVLSSSGKRIiGCIKIAASLKGI 

390 

30 CTCAGCTTCTTTGCGTMCCMTACTGGMGGCATTTAMGGACCTCTGCCGCCTCAGACCTTG 
CACCCTTCCCGATGCAGTCCCTGATGCAGGCTCCCCTCCTGATCGCCCTGGGCTTGCTTCTC 
TGAAAMGCCATCCCAGCTCAGTAGCTTTTCCTGGGATMCTGTC^ 

TGGAGCCTGACCCCATCGTCGTTCCTGGAMTGTGACCCTCAGTGTCGTGGGCAGCACCAGTGTCCCCCTGA 
AGGTGGATTTAGTTTTGGAGAAGGAGGTGGCTGGCCTCTGGATCMGATCCC^ 
35 MCACTTCTGTGATGTGCTTGACATGTTAATTCCTACTC 

ACTGTCCCTTCAAAGMGGMCCTACTCACTGCCCMGAGCGMTTCGmTGCCTGACCTGG 
CCGGGAACTACCGCATAGAGAGCGTCCTGAGCAGCAGTGGGMGCGTCTGGGCTGCATCMGATCGCTGCCTC 

CCCACTCTCTGCCCCCCTTTMTCCCCTTTCTACAGTGAGTC^ 
40 GGCTGGGGCMGCAGCCCTGACCTMGGGAGMTX3AGTTGGACAGTTCTTGATAGCCCAGGGCATCTC 
ACTCATCCCCGTTAACATTCTCTCTAMGAGCCTCGTTCATTTCCAMGCAGTTMGGAAT^ 

TTCTCCTCCGGCCCCCTGTTACMTGMGGGGCAAMGTATTTGCTCTTAG 
TATTTCCTTTCTAGATTTGCCCMTTMTACTAGGGTGCAGTC 
45 GGGAGATATTAGGAGTGCTCTGTTGTTTAC^ 

TGAGAAACATGTTGCTTCCAGGCTTGATTTCGATTT^ 

GTGCAGTGGTGCAATCTCACCTCACTGCAACCTCCGCCTCCTGGGTTCMGCMTTCTCC 
GACTACAGGCCCTGCCACCACGCCCGGCTMTTTC^^ 
GATCTCTTGACCTCGTGATCCGTCCACCTTGGCCT^ 
50 CCTirTTGATTAMGATGOTATTACAATGTAAATATTTCTTA 
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TGAGGCCTGGTCTCCAGTTGGAAATATMTTMGGGTGGCAGGGACTGGAGTC 
CMCTGCCAGATACTGGCMTACTCCAGCCTGGTGACAGAGTGAGACTCTGTCTCAAAA^ 
AGMGCCAAAAATCCAGATTTGTATATGAMTCTTACCATTTTAAAAGATTGGCAGCTMTTATTT^ 
TGTGATGTGTCCCAMCGGACTGGCTCATGGGTGGCCACGTCACAACCTC 
5 ACAACTCCTGTGGCACCGTTTCTCCCTCCACAGGGCCAMGCCATAGTC^ 
AGAGGTATGAGCAGCCTCTCACCTGTGAGCTGTGGGGATCAC 
GGCAGATGGGAMGAGCCTCACCAGCAGCTGCTTTTGGAGCAGGGGTCCM 
GGATTTTTCTACCACCCTGTTACATCATMCMCTTCTGAMCAC 
TAGCMTAMTCTTTTTAACTTGCAAAAAAAAAAAAAAAAAA 

10 391 

MPAALVENSQVICEWASNLEEEMRKIREIVPSYSYIAM^ 
KGEYPSGINTWQFNFKFNLTEDMYSQDSID^ 

LTDSRLPEEEHEFLHILNLFSPSIYDVKYLMKSCKNLKGGLQEVADQLDLQRIGRQHQAGSDSLLTGMAFFRMKELFFEDSIDD 
AKYCGRLYGLGTGVAQKQNEDVDSAQEKMSILAIINNMQQ 

15 392 

GCACGAGGGGGTAGAGGGAAMGAGCTCCGGGCCAGGGGCTGC(^TCGCCGCCGTCGGGGAGTCAGCCCGCCAGCCCGCC^ 
CGTCAGCCCGCCACCAGCTTCGCGGGCCCTGTCGGTCCCGGTAAGCGGGCCTGCGCTTACCGGAMGAGGAGCGTMGATGA^ 
GAGTATCAGACCAMCATTGTCTGGCTTGCACTGTAAMCTAGTTAGCTGMGACGAC1TC 
AGCACTTGTGGAGMTAGCCAGGTTATCTGTGMGTGTGGGC 
20 GCCCAGmCAGTTATATTGCCATGGACACAGMTTTC^^ 

TTACCAATATCAGCTTCTGCGGTGCMTGTOACCTTTTAAAMTTATCCAGCTC 
GTATCCTTCTGGAATCMTACTTGGCAGTTCMm 

CCTTGCTMCTCAGGACTACAGTTTCAGMGCATGMGAGGAAGGGATTGACACACTGC 
AGGAGTGGTTCTCTGTGACMTGTCAAATGGCTTTCATTTC 
25 TTCTCGTTTGCCAGMGAGGMCATGAATTCTTACATATTCTGAAC 

GMGAGCTGCAAAMTCTTMGGGAGGTCTTCAGGMGTTGCTGATCAGTTGGATTTGCA^ 

AGGCTCAGACTCACTGCTGACAGGMTGGCTTTCTTTAGGATGAAAGAGTTGT^ 

CTGTGGGCGGCTCTATGGCTTACXKACAGGAGTG^ 

CCTGGCGATTATCMCMCATGCAGCAGTGATGGCGCCAGGCTCTGCAGGGTGGGCCTG 
30 GACTGTGTACTTATCTTCCCCMGAGAAMTGCTTCTTTTGAGCACACTGTACCTA^ 
TTTTACTGMGAO^AAAGATGTTTTTATTTTAGACC^^ 

CTCATACTCGAGCCTCTCCTCTCTGGTTGCCTCCTGCCACCAGCATCCATGGCTCATTTG 

GTCTGAMCAMGTAGTAAMTGTATATAACTCTTACCTGTTGTCATTCT^ 

GAAGATTCTTACTCTGATTCTCAGCTGAGCTGTGAGGGCTTCCAGGGAAMTGGMCAA 
35 TAGATACTGAGTCTTCCTTTCCTTTTCTGACCCTTC 

TACTATATGGAMTGMTTGCTCTGTGCTGAAATTTGM 

TCTACCTTCATTCATGAGMCTCCAGMTGAGTGTTGACCAC 

TTGCTGTCACATATGCATCATCTGAMTCATTTGAM 

AAGATCAGATTTTTGGGTCATGTCTGTTGTATTTTCAGTMTGTC 
40 CAmTTTCTCTACTTTTCCTTCCAGCAMCCTTGAM 

MGCTAGAGTGGTTACATTTMTCAGGCAGTMGATMTTTGG^ 

GGMGCTCAGGACTTTTTTCTTTAMGCTAGTCATT^ 

AAAAACCAAAACCCACTTTGGATTGGTGGMGTAAAAACTGGTAACTCA 

TATGGGAAACTCMTTTGAMTGATTAGAAMTGTCMGTATTATMG 
45 TTTMTTTTGTAAMTAMGATTTCTTTOT 

394 

TCATCTTTCTCTTTATTCTGACMCATTATGACATTAMTAAAATC 

ATTTAGATMGGCAGAACTGAGCCCCTCGGAATGTATTATCTCA 

AGTCAMTCCTGGGCATCTCCACAGATGTGTGGTAGGGCAC 
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TTTATTTTGTAGCGCCAAGGTCra 
TAGMGAMGTGGGAAAGGTGMGGACAGGGATCCTTO 
GGGCCAAAGMGGTAGCAGCTACTTGGGGTTTTGTGTGGGAGATGGCCGACCACTC 
ACGTGGAATCTCGTCCAAAAAATGG 

5 395 

MKV SAALLCLLLI AATFI PQGLAQPDAINAPVTCC YNFTNRK I SVQRLAS YRRITS SKCPKEAVIFKTIVAKEICADPKQKWVQ 
DSMDHLDKQTQTPKT 

396 

CTMCCCAGAAACATCCMTTCTCAMCTGMGCTCGCACTCTCGCCTCCAGCATGAAAGTCTCTGC 
10 TGCTCATAGCAGCCACCTTCATTCCCCMGGGCTCGCTCAGCCAGATGCM 

CCMTAGGMGATCTCAGTGCAGAGGCTCGCGAGCTATAGMGAATCACCAGCA 

AGACCATTGTGGCCMGGAGATCTGTGCTGACCCCMGCAGMGTGGGTTCAGGATTCCATGGACCACCTGG 
AMCTCCGMGACTTGMCACTCACTCCACAACCCMGMTCTGCAGCTMCTTATTTTCCCCTAGCTTTC 
TTTTATTTTATTATMTGMTTTTGTTTGTTG 
15 TTTTMGTTTATCTTTCATGGTACTAGTGTTTTTTAGATACAGAGAC™ 
CCCCTGGGATGTTTTGAGGGTCTTTGCAAGAA^ 

TTGTGGAMTGMTATTTTGTMCTATTACACCAMTAMTATATTTTTGTACAAAAAAAAAA71AAA 
397 

• ^YLLMFSLLFVACQGAPETAVLGAELSAVGENGGEKPT 
20 LGSPRSKRALENLLPTKATDREMCQCASQKDKKCWNFCQA 

SSEEHLRQTRSETMRNSVKSSFHDPKLKGKPSRERYVTHNRAHW 

398 m ' 

ggagctgtttacccccactctaataggggttcmtataaamgccggcagagagctgtccmgtcagacgcgcctctgcatctc 
cgccaggcgmcgggtcctgcgcctcctckagtcccagctctccaccaccgccgcgtgcgcct^^ 
25 ttctctcctggcaggcgctgccttttctccccg 
cgctttgagggacctgmgctgtttttcttot 
atggattatttgctcatgattttctctctot 

agcgcggtgggtgagmcggcggggagamcccactcccagtccaccctggcggctccgccggtccmgcgctgctcctgctc 
tccctgatggatamgagtgtgtctacttctgccacctggacatcatttgggtcaacactcccgagcacgt 

30 cttggmgccctaggtccmgagagccttggagmtttacttcccacam 

agccaaamgacmgmgtgctggmtttttgccmgcaggaamgmctcagggctgmga 
aatcatmgaaaggaamgactgttccmgcttgggaaamgtgtatct 
agttcagaggmcacctmgacaaaccaggtcggagaccatgagamcagcgtcaaatcatc 
ggcmgccctccagagagcgttatgtgacccacmccgagcacattggtgacagacttcggggcctgtctgmgccatag 

35 cacggagagccctgtggccgactctgcaotctccaccctggctgggatcagagcaggagcatcctctgctggttcctgactggc 
amggaccagcgtcctcgttcaaaacattccmgamggttmggagttcccccmccatcttc 
actgctttggtctcttctttcatctggggatgacmtggacctctcagcagam 

400 

GTGTTGGMTCMTCAATGTCCATTTCAGGMGCTTCTTC 
40 TGGGGCCTGGGTTGTCTCCTTGTCCATCCACT^ 

TTTGGGCTTTGAAATTATAGGGCTACAAGTACTTGTCAGCTCCTT 
CATGGTCAAACTCTGCTTGTTCTGCAGATTTAGCTTGTTATTCATC 
CATGTCAGCAGCATCCAMTGATCATACTGGTCCCCCTGTTTTTGAAAGAGC 
CCTAGTTCT 



45 401 
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400 



MAMVSEFLKQAWF I ENEEQEYVQTVKS SKGG PGSAVS PYPTFNPS SDVAALHKAIMVKGVDEATI I DILTKRNNAQRQQI KAAY 

LQETGKPLDETLKKALTGHLEEVVLALLKTPAQFDADE 

TSDTSGDFRNALLSLAKGDRSEDFGVNEDLADSDARALYEAGE 

DLELKGDIEKCLTAIVKCATSKPAFFAEKLHQAMKGVGTRHKALIRIMVSRSEIDMNDIKAFYQKMYGISLC 
5 KILVALCGGN 

402 

AGTGTGAMTCTTCAGAGAAGAATTTCTCTTTAGTT^ 
TATCAGMTTCCTCMGCAGGCCTGGTTTATTGA 

CCGGATCAGCGGTGAGCCCCTATCCTACCTTCMTCCATCCTCGGATGTCX3CTGCCTTG 
1 0 TGGATGMGCMCCATCATTGACATTCTMCTMGCGAMCMTGCACAGCGTC 

CAGGAMGCCCCTGGATGAMCACTTMGAMGCCCTTACAGGTCACCTTGAGGAGGTTGTTTO 
CGCMTTTGATGCTGATGAACTTCGTGCTGC^ 

GMCTMCAMGAMTCAGAGACATTMCAGGGTCTACAGAGAGGMCTGMGAGAGATCT^CC 
CATCTGGAGATTTTCGGAACGCTTTCCTTTCTC 
15 ATTCAGATGCCAGGGCCTTGTATGMGCAGGAGAAAGGAGAMGGGGACAGACGTAAACGTGTTCMTACCATCCTTACCACCA 
GMGCTATCCACMCTTCGCAGAGTGTTTCAGAMTACACCMGTACAGTMGCATGACATGMCAM 
TGAAAGGTGACATTGAGAAATGO^^ 

MGCCATGAAAGGTGTTGGMCTCGCCATMGGCATTGATCAGGATTATGGTTTCCCGTTCTGAMTTGACATG 
MGCATTCTATCAGMGATGTATC&TATCTCCC 
20 TGGCTCTTTGTGGAGGAMCTAMCATTCCCTTGATGGT 

AAGCTTAMTAGGAMGTTTCTTCMCAGGATTACAGTGTA^ 

ATTATMCTCTGTATMTAGAGATMGTCCATTTTTTA^ 

AACAATACATGAGAAAGATGTCTATGTAGCTGAAAATAAAATGACGTCACAAGAC 

403 

25 MPUQVSFTNRNYDLDYDSVQPYFYCDEEENFYQQQQQSEI^ 

DNDGGGGSFSTADQLEMVTELLGGDMVNQSFICDPDDETFIKNIIIQDCMWSGFSAAMLVSEKLASYQMRKDS 
SVCSTSSLYLQDLSAMSECIDPSWFPYPLTOSSSPKSCASQDSSAFSPSSDSLLSSTESSPQGSPEPLVLHEETPPTTSSDS 
EEEQEDEEEIDWSVEKRQAPGKRSESGSPSAGGHSKPPHSPLVLKRCHVSTHQHNYMPPSTRKDYPMKRVKLDSVRVLRQI 
SNNRKCTSPRSSDTEENVKRRTHNVLERQRRNELKRSFFALRDQI PELENNEKAPKWILKKATAYI LSVQAEEQKLI SEEDLL 

30 RKRREQLKHKLSQLRNSCA " 

404 

CTGCTCGCGGCCGCCACCGCCGGGCCCCGGCCGTCCCTGGCTCCCCTCCTGCCTCG 

CGGGAAAAMGMCGGAGGGAGGGATCGCGCTGAGTATAAMGCCGGTTTTCGGGGCTTTATCTMCTCGCTGTAGT 

GCGAGAGGCAGAGGGAGCGAGCGGGCGGCCGGCTAGGGTGGAAGAGCCGGGCGAGCAGAGCTGCGCTGCGGGCGTCCTGGGAAG 
35 GGAGATCCGGAGCGMTAGGGGGCTTCGCCTCTGGCCCAGCCCTCCCGCTTGATC^ 

GCATCCACGAAACTTTGCCCATAC^AGCGGGCGGGCACTTTGCACTGGMCTTACMCA^ 

ACGCGGGGAGGCTATTCTGCCCATTTGGGGACACTTCCCCGCCGCTGCCAGGACCCGCTTCTCTC 

GCTTAGACGCTGGATTTTTTTCGC^TAGTGGAAMCCAGCAGCCT^ 

MCTATGACCTCGACTACGACTCGGTGCAGCCGTAT^ 
40 GAGCTGCAGCCCCCGGCGCCCAGCGAGGATATCTGGMGAMTTCGAGCTCCT^ 

TCCGGGCTCTGCTCGCCCTCCTACGTTGCGGTCACACCCTO 

ACGGCCGACCAGCTGGAGATGGTGACCGAGCTGCTGGGAGGAGACA 

ACCTTCATCAAAMCATCATCATCCAGGACTGTATGTGGAGCGGCTTCTCGGCCGCCGCCM 

TCCTACCAGGCTGCGCGCAMGACAGCGGCAGCCCGMCtt^ 
45 CAGGATCTGAGCGCCGCCGCCTCAGAGTGCATCGACCCCTCGGTGGTCTTCCCCTACCCTC 

TCCTGCGCCTCGCMGACTCCAGCGCCTTCTCTCCGTCCTCGGATTCTCTC 

CCCGAGCCCCTGGTGCTCCATGAGGAGACACCGCCCACCACCAGCAGCGACTCTGAGGAGGMCMGMGATGAG 
GATGTTGTTTCTGTGGAAMGAGGCAGGCTCCTGGCAAMGGTCAGAGTCTGGATCAC 
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CCTCACAGCCCACTGGTCCTCMGAGGTGCCACGTCTCCACACATCAGCACMCTACGCAGCGCCTCCCTCCACTC 
TATCCTGCTGCCAAGAGGGTCAAGTTGGACAGTGTCAGA 

TCCTCGGACACCGAGGAGAATGTCMGAGGCGMCACACMCGTCTTGGAGCGCCAGAGGAGGMCGAGCTAm 
TTTGCCCTGCGTGACCAGATCCCGGAGTTGGAAMCMTGAAMGGCCCC 
5 ATCCTGTCCGTCCMGCAGAGGAGCAAMGCTCA^ 

CTTGMCAGCTACGGMCTCTTGTGCGTMGGAAAAGTMGGAAMCGATTCCT^ 
TGMCTTGTTTCAMTGCATGATCAMTGCMCCTC^ 
CTGCCTCAMTTGGACTTTGGGCATAAMGMCTTTTT^ 
ATTGTTTTTAAAAAATTTTAA 

10 405 

mgkmllhpslvllllvllptdasvsgkpqymvlvpsllhtettekgcvllsylnetvw 

hcvafavpksssneevmfltvqvkgptqefkkrttvmvknedslvpvqtdksiykpgq 

dpkgmiaqwqsfqlegglkqfsfplssepfqgsykwvqkksggrteh^ 

lytygkpvpghvwsicrkysdasix:hgedsqafcekfsgql^ 
15 grqsseitrtitklsfvkvdshfrqgipffgqvrlvdgkgvpipn^ 

vrvnykdrspcygyqwseeheeahhtaylvfspsksfwlepmshelpcghtqtvqahyilnggtlixslkklsf^ 

iwtgthgllvkqedmkghfsisipvksdiapvarlliyavlptgdvigdsakydvenclankvdls 

mpqsvcalravdqsvllmpdaelsassvynllpepcdltgfpgplndqddedcinrhnvyingitytpvss 

mglkaftnskirkpkmcpqlqqyemhgpeglrvgfyesdvmgrgharlvhveephtetvrkyfpetwito^ 
20 tvpdtitewkagafclsedagl^isstaslrafqpffveltmpysvirgeaftlkatvlmlpkcirvsvqleaspaf 

keqaphc icangrqtvswaotpkslgnwtvsae 

gevseelslklppnvveesarasvsvlgdilgsam^^ 

ntgyqrqlnykhydgsystfgerygrnqgntwltaf^ 

ggvedevtlsayitialleipltvthpwrnalfclesa^^ 
25 kdnsvhwerpqkpkapvghfyepqapsaevemtsyvllayl 

lskygmtftrtgkaaqvtiqssgtfsskfqvdnnnrlllqqv^ 

VQTLPQTCDEPKAHTSFQISLSVSYTGSRSASMAIVDVKMVSGFIPLKPTVKMLERSWHVSRTEV 
LFFTVLQDVPVRDLKPAIVKVYDYYETDEFAIAEYNAPCSKDLGNA 

407 

30 MKLOmfLSLLMLVAAFCSPALSAPMGSDPmCCFSYTARKLPRNFVVDYYETSSLCSQPAW 
YVYDLELN 

408 

• AGCACAGGACACAGCTGGGTTCTGMGCTTCTGAGTTCTGCAGCCTCACCTCTGAGAAMCCTCTTT 
GCTCTGCGTGACTGTCCTGTCTCTCCTCATGCTAGTAGCTGCCTTCTGCTCTCCAGCGCTC 
35 TCCCACCGCCTGCTGCTTTTCTTACACCGCGAGGMGCTTCCTCGCMCTTTGT^ 

CTCCCAGCCAGCTGTGGTATTCCAAACCAAMGMGCMGCMGTCTGTGCTGATCCCAGTC 
GTATGACCTGGMCTGMCTGAGCTGCTCAGAGACAGGMGTCTTCAGGGMGGTCACCTGAGCCCGGATC 
CACATCTCCTCCATACTCAGGACTCCTCTCCGCAGTTCCTGTCCCTrcTC 
. ATTAGGTGTCATTTCCATTATTTATATTAGTTO 
40 TGCTGTTGCAAATACATGGATMCACATTTGATTC 

409 

MESRGPLATSRLLLLLLLLLLRHTRQGWALRPVLPTQSAHDPPAVHLSNGPGQEPIAVMTFDLTKITKTSSSFE 
FYGDTNPKDDWFMLGLRDGRPEIQLHNW^ 
RIALGGLLFPASNLRLPLVPALDGCLRRDSWLDKQAEI^ 
45 DLGLKQMGSGHLLAliGTPENPSWLSLHLQDQKVVLSSGSGPGLDLPLVLGLPLQLKLSMSRVVLSQGSK^ 
LNLWAKPQGRLFLGALPGEDSSTSFCLNGLWAQGQRLDVDQALNRSHEI WTH SC PQS PGNGTDASH 

410 



WO 02/46465 



PCT/GBOJ/05458 



402 



AT«3AGAGCAGA(^CCACTGGCTACCTCGCGCCTGCTGCTGm 

GCCCTGAGACCTGTTCTCCCCACCCAGAGTGCCCACGACCCTCCGGCTGTCCACCTCAGCMTGGCCCAGGACAAGAGCCTATC 

GCTGTCATGACCTTTGACCTCACCMGATCACAAAMCCTCCTCCTCCTTTGAGGTTCGM 

TTTTATGGGGATACCMCCCTAAGGATGACTGGTTTATGCTGGGACTTCGAGACGGCAGGCCTC 
5 TGGGCCCAGCTTACGGTGGGTGCTGGACCACGGCTGGATGATGGGAGATGGCACCAGGTGGMGTCMG 

GTGCTGCTGGAGGTGGATGGGGAGGAGGTGCTGCGCCTGAGACAGGTCTCTGGGCCCCTGA 

AGGATTGCGCTTGGGGGGCTGCTCTTCCCCGCTTCCMCCTTCGGTTG 

GATTCCTGGCTGGACAMCAGGCCGAGATCTCAGCATCTGCCCCCACTAGCCTCAGAAGCTGTC 

ATATTTCTCCCTCCAGGGACTCAGGCAGMTTCAATC^ 
10 GACCTGGGACTCMGCAGGCAGCAGGCTCAGGCCACCTCCTTGCTCTTGGGACACCAGAGM 

CTCCAAGATCAAAAGGTGGTGTTGTCTTCTGGGTCGGGGCCAGGGCTGGATCTGCCCCTGGT^ 

MGCTGAGTATGTCCAGGGTGGTCTTGAGCCAAGGGTCGMGATGMGGCCCTTGCCCTGCCTCCCTTA 

CTTMCCTCTGGGCCMGCCTCMGGGCGTCTCTTCCTGGGGG 

GGCCTTTGGGCACMGGTCAGAGGCTGGATGTGGACCAGGCCCTGMCAGMG 
15 AGCCCAGGCAATGGCACTGACGCTTCCCATTAA. 

411 

MADKLTRIAIVNHDKCKPKKCRQECKKSCPVVRMGKLCIEVTPQSKIAWISETLCIGCGICIKKCPFGALSIVN 
HRYCANAFKLHRLPIPRPGEVLGLVGTNGIGKSAALKILAGKQKPNLGKYDDPPDWQEILTYFRGSELQNYFTKILEDDLKAII 
KPQYVARFLRLAKGTVG S I LDRKDETKTQAI VCQQLDLTHLKERNVEDL SGGELQRFACAWCI QKADIFMFDEPS S YLDVKQR 
20 LKAAITIRSLINPDRYIIWEHDLSVLDYLSDFICCLYGVPSAYGWTM 
ANEEEVKKMCMYKYPGMKKMGEFELAIVAGEFTDSEIMVMI^ 

TGSWQLLHEKIRDAYTHPQFVTDVMPLQIENIIDQEVQTLSGGELQRVRLRLCLGKPADVYLIDEPSAYLDSEQRLM 

KRFILHMKTAFWEHDFIMTYLADRVIVFDGV^^ 

KKSGNYFFLDD 

25 412 

CCGGTCCTGAGACACGCTGTGTGGCTGAAMGTGMGGCMGAGCTCA 

CTCCAGMGAGCTGGATATTCTTTCGCCCAGTTAT^ 

CCTMGAMTGTCGACAGGMTGCAAAMGAGTTGTCCTC^ 

MMTAGCATGGATTTCCGAMCTCTTTGTATTGGTTGTGGTATCTGTATTMGAM 
30 AATCTACCMGCAACTTGGAAAMGAAACCACACATCGATATTGTGCCMTGCCTTCAMCTTCAC 

CCAGGTGMGTTTTGGGATTAGTTGGMCTMTGGT^ 

MCCTTGGAMGTACGATGATCCTCCTGACTGGCAGGAGATTTTGACTTA^ 

MGATTCTAGAAGATGACCTAAMGCCATCATCAAACCTCMTATGTAGCCAGATTCC^ 

TCTATTTTGGACCGAAMGATGAMCAAAGACACAGGCMTTGTATGTCAGCAGCTTGATTTM 
35 GTTGMGATCTTTCAGGAGGAGAGTTGCAGAGATTTGCTTGT 

GAGCCTTCTAGTTACCTAGATGTCMGCAGCGTTTAAAGGCTGCTATTACTATACGATCTCTMTAM 

ATTGTGGTGGMCATGATCTMGTGTATTAGACTATCTCTCCGACTTCATCTGCTC 

GTTGTCAOTATGCCTTTTAGTGTMGAGMGGCATAMCATTTTTT^ 

GATGCATCACmTTTTTAMGTGGCTGAGACAGCAMTC 
40 MGMAAAMTGGGAGAATTTGAGCTAGCAATTGTAGCTGGAGAGTTTACAG 

GGMCGGGTAAMCGACATTTATCAGMTGCTTGCTGGM 

AGmTMGCCACAGAAMTTAGTCCCAMTCMCTGGMGTGTTCGCCAGTTACTACATGAAAAGATM 
CACCCACMTmTGACCGATGTMTGMGCCTCTGCAAATTGAAMCATCA 
GMCTACAGCGAGTACGTTTACGCCTTTGCTTGGGCAMCCTC 
45 TCTGAGCAMGACTGATGGCAGCTCGAGTTGTCAMCGTTTCATACTCCATGCAAAAMG 
TTCATCATGGCCACCTATCTAGCGGATCGCGTCATCGTTTTTGATGGTGTTCCATC 
ACCCTTTTGGCTGGCATGMTAAATTTTTGTCTCA 

AACAMCTTMTTCMTTMGGATGTAGMCAAMGMGAGTGGAMCTACTTTTTC^ 
ATTGATMGCCATTTATTAAMGGAGTATTTACTAGMTTTTTTC 
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403 

TCTGGMCTTGMGTATMTATACTTAATATMCATAAAAAGCC^ 

CTTTTCTGTTCCTGMGAGGCTCTTTTGTGCATAATATTCTAA 

CATGATGTATGGGMGATTTTCAGTAGGTGTATTATATTCACGGTACC 

GAAMGGGTTTCTGTACTTACCTGGTTTGCC^ 
5 CACTGATTGACATTTGATAMTAAACATCAGGATTATGTTTATTGTTTC 

TTCCGAGGMGATTATCTACTGCAAMCACCACTGTTGGAAAMTAGGTATTTTTAMT 

TTAMCATGTTTMGCAAAMCCMTTCAGTCCATTCCCCGCAAAAM 

TTCCATGAGGTTCTGTATTCAGTCATTCTCTAGGTAATGTCATTTT^ 

GG AAAMGCATTTCTAAAGMTATTTGCTTCCCTTAGMCTAC 
10 TTTTTTTTMGTTCCACAGATTTTTC 

CCCCAGATGGCMTACAMGTATCCCCTGGTACCACATATATTCATTOTGAGTTTGGATATA^ 

TTGTAGTTCCAAAMCCCATCTAMTTTCTTGAGTTCCTGMTTTTGMCA 

TACTTCTGACTAMCTGGMTTATGAGTGAGGAAGAGTGTTTACT 

AGAAACTGTTTGACAMCTTTAAGAGCTACTTGAMTMCAGAAGTCTTGAT^ 
1 5 TAACTGGACCCTATTATGCCTTTTAAAAA^ 

ATAAAAAMTGATCTCCCTTTATTACCCTCCCAMGG 

TGTGCACmCCTMGTGTGMTATGTAAAGGGTTTGT 

MGGATAGGTTAAATTTGTGMTGATCATTTCAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

20 413 

MAAVKTLNPKAEVARAQAALAVNI S AARGLQDVLRTNLG PKGTMKMLVSGAGDIKLTKDGNVLLHEMQIQHPTASLLAKVATAQ 
DDITGDGTTSNVLI IGELLKQADLYI SEGLHPRI ITEGFEAAKEKALQFLEEVKVSREMDRETLIDVARTSLRTKVHAELADVL 
TEAWDSIMIKKQDEPIDLFMIEIMEMKHKSET^ 
AEEREKLVKAERKFIEDRVKKIIELKRKVCGDSDKGFWINQK^ 
25 DDLSPDCLGHAGLVYEYTLGEEKFTFIEKCNNPRSVTLLIK^ 
ALIKHKPSVKGRAQLGVQAFADALLIIPKVLAQNSGFD^ 
LHSCTVIATNILLVDEIMRAGMSSLKG 

414 

CGAGMGACCCGGATAGTTCCTCCCGGCCACGCCGCGCCGGCTCTGGGCACTCM 
30 CTATGGCGGCGGTGAAGACCCTGAACCCCAAGGCCGAGGTGGCCCGAGCGCAGGCGGCGCTGGCGGTCAACATCAGCGCAGCGC 

GGGGTCTGCAGGACGTGCTMGGACCMCCTGGGGCCCMGGGCACCATGMGATGCTCGTTTCTGGCG^ 

TTACTAMGACGGCMTGTGCTGCTTCACGAMTGCAAATO 

AGGATGATATMCTGGTGATGGTACGACTTCTMTGTC^ 

MGGCCTTCATCCTAGMTAATCACTGAAGGATTTGMGCT^ 
35 GCAGAGAGATGGACAGGGAAACACTTATAGATGTGGCCAGMCATCTCT^ 

TMCAGAGGCTGTAGTGGACTCCATTTTGGCC^ 

TGAMCATAAATCTGAMCTGATACAAGCTTMTCAGAGGGCTTGTTTTGGACCACG^ 
' GGGTGGAGGATGCATACATCCTCACTTGTMCGTGTCATTAGAGTATGAGAAMCAGAAGTC 
GTGCAGAAGAGAGAGAAAMCTCGTGAMGCTGAMGAAMTTCATO 
40 MGTCTGTGGCGATTCAGATAMGGATTTGTTGTTATTM 

MGGCATAGTCGCTCTGCGCAGAGCTAAMGGAGAMTATGGAGAGGCTGACTCTTGCTTGTGGT^ 
TTGACGACCTMGTCCTGACTGCTTGGGACATGCAGGACTTGTATATGAGTATACATTC 

AGAMTGTMCAACCCTCGTTCTGTCACATTATTGATCAMGGACCAAATMGCACACACTCACTC ■ 
GGGACGGCTTGAGGGCTGTCAAAMTGCTATTGATGATGGCTG * 
45 MGCCCTGATTAMCATAAGCCCAGTGTAMGGGCAGGGCACAGCTTGGAGTCCMGCATTTGCTC^ 
CCMGGTTCTTGCTCAGMCTCTGGTTTTGACCTTC^ 
TTGTGGGTGTGGACCTGMCACAGGTGAGCCMTGGTGG^ 

TTCTTCACTCCTGCACTGTGATTGCCACCMCATTCTCTTGGTTGATGAGATCATGCGAGC 
GMTTGMGCTTCCTCTGTATCTGMTCTTGMGACTGCAMGTGATCCTC 
50 CAMTMTTTTGAMGAMTTTTCCCATATGAAAAMGGAGAGMCACTGGC 
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GTATTTTTAAAMTTGCACTGAAG 

GACATAAMTTCCATGTTAGATMGCATATGTTACTTACCTTC 
415 

MTAPGMGRCPPTTWLGSLLLLVCLLASRSITEEVSEYC^ 
5 AFLLVQDIMEDTMRFRDNTANPIAITVQLQELSLRL^ 
KNCNNSFABCSSQDWTKPD^ 

PPRSTCQSFEPPETPWKDSTIGGSPQPRPSVGAFNPGMEDILDS 

PARPSNFLSASSPLPASAKGQQPADVTATALPRVGPVMPTGQDWNHTPQKTDHPSALLRDPPEPGSPRISSLRPQALSNPSTLS 
AQPQLSRSHSSGSVLPLGELEGRRSTRDRTSPAEPEAAPASEGAARPLPRFNSVPLTDTGHERQSEGSSSPQLQESVFHLLVPS 
1 0 VILVLLAVGGLLFYRWRRRSHQEPQRADSPLEQPEGSPLTQDDRQVELPV 

416 

CCTGGGTCCTCTCGGCGCCAGAGCCGCTCTCCGCATCCCAGGACAGCGGTGCGGCCCTCGGCCGGGGCGCC 
CCAGCGAGCCAGCTGCCCCGTATGACCGCGCCGGGCGCC^ 

TTGGTCTGTCTCCTGGCGAGCAGGAGTATCACCGAGGAGGTC ' 

15 TCTCTGCAGCGGCTGATTGACAGTCAGATGGAGACCTCGTGCCAM 

CCAGTGTGCTACCTTAAGAAGGCATTTCTCCTGGTACAAGACATAATGGAGGACACCATGCGCTTCAGAGATAACACCGCCAAT 

CCCATCGCCATTGTGCAGCTGCAGGAACTCTCTTO 

TGCGTCCGMCTTTCTATGAGACACCTCTCCAGTO^ 

MGGACTGGAATATTTTCAGCMGMCTGCMCMCAGCTTTGCTGM 

20 MCTGCCTGTACCCCAMGCCATCCCTAGCAGTGACCCGGCCOT 

GTGGCTGGCTTGACCTGGGAGGACTCTGAGGGMCTGAGGGCAGCTCCCTCTTGCCTGGTGAGCAGCCCCTGCACACAGTGGAT 
CCAGGCAGTGCCMGCAGCGGCCACCCAGGAGCACCTGCCAGAGCTTTGAGCCGCCAGAGACCCCAGTTGTCAAGGACAGCACC 
ATCGGTGGCTCACCACAGCCTCGCCCCTCTGTCGGGGCCTTCAACCCCGGGATGGAGGATATTCTTC 
MTTGGGTCCCAGMGMGCCTCTGGAGAGGCCAGTGAGATTCCCGTACCCCMGGGACAGAGCTTTCCCCCTCCAGGCCAGGA 

25 GGGGGCAGCATGCAGACAGAGCCCGCCAGACCCAGCAACTTCCTCTCAGCATCTTOTCCACTCCCTGCATCAG 

CAGCCGGCAGATGTAACTGCTACAGCCTTGCCCAGGGTGGGCCCCGTGATGCCCACTGGCCAGGACTGGAATCACACCCCC 
AAGACAGACCATCCATCTGCCCTGCTCAGAGACCCCCCGGAGCCAGGCTCTCCCAGGATCTCATCACTGCGCCCCCAGGCCCTC 
AGCMCCCCTCCACCCTCTCTGCTCAGCCACAGCTTTCCAGMGCCACTCCTCGGGCAGCGTGCTGCCCCTTGGGGA 
GGCAGGAGGAGCACCAGGGATCGGACGAGCCCCGCAGAGCCAGAAGCAGCACCAGCAAGTGAAGGGGCAGCCAGGCCCCTGCCC 

30 CGTTTTAACTCCGTTCCTTTGACTGACACAGGCCATGAGAGGCAGTC 
TTCCACCTGCTGGTGCCCAGTGTCATCCTGGTCTTGCTG 

CMGAGCCTCAGAGAGCGGATTCTCCCTTGGAGCAACCAGAGGGCAGCCCCCTGACTCAGGATGACAGACAGGTGGAACTGCCA 
GTGTAGAGGGMTTCTMGCTGGACGeACAGMCAGTCTCTTCGTGGGAGGAGACATTATGGGGC 
TGGCCATCCTCCTOGAATGTGGTCTGCCCTCCACCAGAGCTCCTGCCT^ 
35 TCTGTCTCAACCCGCAGACCCTTGACTGAATGAGAGAGGCCAGAGGATGCTCCCCATGCTGCCACTATTO 
AGGCTCCCATGTGCTTGAGGMGGCTGGTGAGCCCGGCTCAGGACCCTCTTCCC 
CCATGCCGGMCCCAGGCCAGGGACCCACCGGCCTGTGGTTTC^ 
ACCCAGAGGGCCTGCCTGGTGCCMGGTATCCCAGCCTGGACAGGCATGGACC 

GGGCGGGACAGCCTCGGCCTGATTTCCCGTAMGGTGTGCAGCCTGAGAGACGGGMGAGGAGGCCTCTGCACCTGCTGGTCTC 
40 CACTGACAGCCTGAAGGGTCTACACCCTCGGCTCACCTMGTCCCTGTGCTGGTTGCCAGGCCCAGAGGGGAG^ 

CCTCAGGACCTGCCTGACCTGCCAGTGATGCCMGAGGGGGATCAAGCACTGGCCTCTGCCCCTCCTCCTT^ 

GAGCTTCTCCAGCAGGCCMGCAGAGGCTCCCCTCATGMGGMGCCATTGCACTGTGMCACTGTAC 

CTCCCCCCGTCCATCCATGAGCCAGCATCCGTCCGTCCTCCACTCTCCAGCCTCTCCCCAGCCTCCTG^ 

CCAGTCGACTGAGGGAGCCCCTCAGCCCTGACCTTCTC 
45 TCCTGGGCCGCCAGCCAGAGCCGCTCTTTAGGCTGTGTTCTTCGCCCAGGTTTCTGCA 

GMGGGACTAGTGGGAGAGAGCMGGGAGGGGAGGGCACAGACAGAGAGCCTACAGGGCGAGC 

GAMTATAGGTATGCACCTGAGGTTGGGGGAGGGTCTGCACTCCCAMCC^ 

ACCTGGGGCTGAGCAGGTTATCCCTGTCAGGAGCCC^ 

CCACTTCTCACCCTTCTTTCCTCCTGACCTO 
50 ACCAGGCMGCCAGGGTGGGAGAGCMTCAGGAGAGCCAGGCCTCAGOT^ 
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CTGTGGTGCTGGCTCTGAGGCCTAGGCM 

TCTCCTGCCGCCCTTTGTCCTCCGCTMGAGACCCTGCCCTACCTGGCCGCTGGG 
AGTGAGGGTCGGCTGGCCCCACCTTCCCTGTCCTGATGCCGACAGCTTAGGGAAGGGCACTGM 
TTCTAGTCACAGCCTCTATATTTGATGCTAGAAMCACATATTTTTAM 
5 CCCTACCTTAMCATATAATATTTTAMGGTCAAAAMGCAATCCAA 
CAGAGCAGGAGGAGCCCCAGAGCCACCTCTGGTGTCCra 
AGAGGACATTGGCTCACGCACTGTGAGATTTTGTTTTW 
TATTTAAMGATAGGAAGCTGCTTATATATTTMTAATAAMGMGTC 

417 

10 MSVPAFIDI SEEDQAAELRAYLKSKGAEI SEENSEGGLHVDLAQI I EACDVCLKEDDKDVESWNSWSLLLI LEPDKQEALIE 
SLCEKLWFREGERPSLRLQLLSNLFHGMDKOTPWYTVYCSLIEWASCGAIQYIPTELDQVRKW 
YEALADCKKSDMSKVMVELLGSYTEDNASQARVDAHRCIVEPLKDPNAFLFDHLLTLKPVKFLEGELIHD 
VKFYQNNKDFIDSLGLLHEQNMAW1RLLTFMGMAIENKEISFDTMQQELQIGADDVEAFVID 
STHRTFGKQRWQQLYDTLNAWKQNLNKVKNSLLSLSDT 

15 418 

GGGTCGGCGTGGTCTTGCGAGTGGAGTGTCCGCTGTGCCCGGGCCTGCACCATGAGCGTCCCGGCCTTC 

GMGATCAGGCTGCTGAGCTTCGTGCTTATCTGAMTCTAAAGGAGCTGAGATTTCAGM 

GTTGATTTAGCTCAAATTATTGMGCCTGTGATGTGTGTC 

GTATCCCTACTCTTGATCCTGGMCCAGACMGCMGMGCTTTGATTGAAAGCCTATGTGAAMG 
20 GGTGMCGCCCGTCTCTGAGACTGCAGTTGTTMGCMCCTTTTCCACGGGATGGATMGMTACTCCTGTMGATACACAGTC 
TATTGCAGCCTTATTGMGTGGTAGCATCTTGTGGGGCC^ 

TCTGACTGGMTCTCACCACTGAAAAAAAGCACACCCTTTTMGACTACTTTATGAGGCACTTGC 
GCTGCTTCAAMGTCATGGTGGMTTGCTCGGMGTTACACAGAGGACMTGCTTCCCAGGCTCGAG 
ATTGTCGMCCATTGAMGATCCAMTGCATTTCTTTTTGACCAGCTTCTO 
25 CTTATTCATGATCTTTTAACCATTTTTGTGAGTGCTAMTTGGCA 
GATTCACTTC&CCTGTTACATGMCAGMTATGGCAAAM 

ATTTCTTTTGACACAATGCAGCMGMCTTCAGATTGGAGCTGATGATGTTGMGCA 

ATGGTCTACTGCAAAATTGATCAGACCCAGAGAAMGTAGTTGTCAGTCATAGCACACATCGGACATTTC 

CAACMCTGTATGACACACTTMTGCCTGGAMCAAMTC 

419 ' 

MSANATLKPLCPILEQMSRLQSHSNTSIRYIDHMVLLHGLASL 
PFFTYFLAVGHSWELGTTFCKLHSSIFFLIWASGFL^ 

FRDTI SRLDGRIMC YYNVLLLNPGPDRDATCNSRQAALAVSKFLLAFLVPLAI I AS S HAAVSLRLQHRGRRRPGRF VRLVAAW 
35 AAFALCWGPYHWSLLEARAHMPGLRPLVWRGLPFVTSLAFFNSVMPVLYVLTCPDMLm 
GSSRRRRTSSTARSASPLALCSRPEEPRGPARLLGWIiLGSCAASPQTGPLNRALSSTSS 

420 

CAGCCTCCCTCTCCCACCTCTGTCTGCCCGCTGCCTCTTGTCTAGCTGCTC 

GTGCTCCCTCTGTGCCCAGAGCCCCACGATGTCGGCCMCGCCACACTGMGCCACTCTGCCCCATCCTGGAGCAGATOAGCCG 
40 TCTCCAGAGCCACAGCMCACCAGCATCCGCTACATCGACCACGCGGCCGTGCTGCTGCACGGGCTGGC 

GGTGGAGAATGGAGTCATCCTCTTCGTGGTGGGCTGCCGCATGCGCCAGACCGTGGTCACCACCTC 

GTCCGACCTGTTGGCCTCTGCTTCCCTGCCCTTCTTCACCTACTK:TTC 

CTGCAMCTGCACTCCTCCATCTTCTTTCTCMCATGTTCGCCAGCGGCTTCCTGCTCAG 

GCAGGTGGTGCGGCCGGTGTGGGCGCAGAACCACCGCACCGTGGCCGCGGCGCACAAAGTCTGCCTGGTGC 
45 GGTGCTCMCACGGTGCCCTATTTCGTGrTCCGGG^ 

GCTCCTGMCCCGGGGCCTGACCGCGATGCCACGTGCAACTCGCGCCAGGCGGCC 

CCTGGTGCCGCTGGCGATCATCGCCTCGAGCCACGCGGCCGTGAGCCTGCGGTTGCAGCACCGCGGCCGC 

CTTCGTGCGCCTGGTGGCAGCCGTCGTGGCCGCCTTCGCGCTC 
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GGCGCACGCAMCCCGGGGCTGCGGCCGCTCGTG^ 
CMCCCGGTGCTCTACGTGCTCACCTGCCCCGACA^ 

GGTGGACGACAGCGAGCTGGGTGGCGCGGGMGCAGCCGCCGCCGCCGCACCTCCTCCACCGCCCGCTCGGCCTCCCCTT 
TCTCTGCAGCCGCCCGGAGGMCCGCGGGGCCCCGCGCGTCTCCTCGGCTGGCTGCTGGGCAGC 
5 GGGCCCCCTGMCCGGGCGCTGAGCAGCACCTCGAGTTAGAACCCGGCCCACGTAGGGCGGCACTCACACGCGAAAGTATCAC 
AGGGTGCCGCGGTTCMTTCGATATCCGGACTCCTGCCG 

MGCGCCCCGGGAGACTCTGMTCTTTTTCAGAMCAGTGAGl^AMGCAGTGCTTCTC 
CTAGGGGTCTTGTTAAGTGCAGTCTGATCCAGGAGGCCGGGGCCGGG^ 

MGCCAGTGCGGCAGGTTCACAGGCGAGGCCTGGAGTMCACAMGTGAMCTCGTMTAGACTTCCCACTCTAGGGCAGTGGA 
10 GTCGGMGGGCACACGGGGTGCGTCTCCCCGGAGTTCAGTTTTACCAGATGATGGGGGAGGGGGGM 
CATCCATGTATTTTTGGAGAAGAGAGAGGAMGGTTTGAGMGCACTGTTCCAGCCTGCCCTC 
GCGCTAGACGCTTCATCCCACMTCTTAAGGGGCAGCTTCTATTAGCCAGTCTOT 
TTMGTGACTTGCCCAGTTTCAGGGCTMCGACCACAGGGTCTGCACTCTMCCCTAGGCATCAC 
TGAGCGAGGACATTCTCTGACCTACTCGAGGGACTTMGATGCTACCTTGTGACCCAGCACTGCCCAMGTC 
15 • MGCAGCAGGGGATGGCGTGGTCMGCACTCGGGAMCCTGGG 

TCGTTAGMGTTGAATGGGCACAGCMCTCTMGACTACAGCACACGTCATTTCTTAGCTAAG 

CTGGTGTTCTGTGGGATCCCTCTGGGCACTGGTMTCCCMGATCTGTGCAGCCCCGCCTCCAGGC^ 

TACCATTTCCCTTTTGCGGATGGGAGGGGTMCT 

GCCGTGGACTTGGGGTCAGAGACTGCTGTGTTTGAGCTCTGCAGCCCAGGGACCGAAMGTTGGTGTC 
20 TGGATGAAATGTCAGTGGMGMGCAGATGAGAMCTCTTGAGATCTTGGTCCTGTC 

ACTGMGGCCTGGCCCACAGCAGGTGCTGAGCAMGGGMCAGTGAGGTGCCCAGCTAGCTGCAGAGCCACCCTC 
CTCGCCCCTGCTCCCTCCCATCCCTTCCCCCTTTACTCATAGCACTTCCCCCATO 
ATGTTTTCTCTCCATCAGMTGAMGCTC^ 
CCTGTATGCMCAGGCACTCMTAMTATTTTTGCTGTAGACTGG 

25 421 

MASTSTTIRSHSSSRRGFSASSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISIGGGSCAISGGYG 
SRAGG S YGFGGAG SGFGFGGGAGIGFGLGGGAGLAGGFGGPGFPVC PPGGI QEVTVNQSLLTPLNLQIDPAI QRVRAEEREQI K 
TLNNKFASFIDKTOFLEQQNKVLDTKW^ 

YEDEINKRTAAEWEFVTLKKDVDAA YMKVELQAKADTLTDEINFLRAL YDAEL SQMQTHI SDTS WLSMDNNRNLDLDSI I AE 
30 VKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCmQMIADAEQRGE^ 
KDAKNKLEGLEDALQKMQDLMLLKEYQELMNVKIiALDVEIATYRKLLEGEECRLNGEGVGQVNISW 
SGLGLGGGSSYSYGSGLGVGGGFSSSSGRATGGGLSSVGGGSSTIKYTTTSSSSRKSYKH 

422 

■ ATGGCCAGCACATCCACCACCATCAGGAGCCACAGCAGCAGCCG 
35 AGCCGCTCTCGCTTCAGCAGCATCTCCGTGTCCCGCTCCAGGGGCAGTGGTC 

GGCAGCCGCAGTCTGTATGGCCTGGGGGGCTCCMGAGGATCTCCATTGGAGGGGGCAGCTGTC 

AGCAGAGCCGGAGGCAGCTATGGCTTTGGTGGCGCCGGGAGTGGATTTGGTT^ 

GGTGGTGGAGCCGGCCTTGCTGGTGGCTTTGGGGGCCCTGGCTTCCCTGTGTGCCCCCCTGGAG 

MCCAGAGTCTCCTGACTCCCCTCMCCTGCAMTTGACCCCGCCATCCAGCGGGTGCGGGCCGAGGAGCGTGAGCAGATGAAG 
40 ACCCTCAACMCMGTTTGCCTCCTTCATCGACMGGTGCGGTTCCTAGAGCAGCAGMCAAGGTTCTG^ 

CTGCTGCAGGAGCAGGGCACCMGACTGTGAGGCAGAACCTGGAGCCGTTGTTCGAGCAGTACATCM 

CTGGACAGCATCGTGGGGGAACGGGGTCGTCTGGACTCGGAGCTGAGAMCATGCAGGAC^ 

TATGAGGATCAMTCAACAAGCGCACAGCAGCAGAGMTGMTT^ 

MGGTTGMCTGCAAGCCMGGCAGACACTCTOCAGATGAGA 
45 ATGCAGACCCACATCTCAGACACATCCGTGGTGCTATCCATC^ 

GTCMGGCCCMTATGAGGAGATTGCTCAGAGGAGCAGGGCTGAGGCTGAGTCCTC^ 

GTCACAGCAGGCAGACATGGGGACGACCTGCGCMCACCMGCAGGAGATOCTGAGATCMCCGCATGATCCAG 
TCTGAGATCGACCACGTCMGMGCAGTGTGCCMCCTACAGGCTG 
MGGATGCTMGMCMGCTGGMGGGCTGGAGGATGCCCTGCAGMGGCCMGCAGGACC 
50 CAGGAGCTGATGMCGTCMGCTGGCCCTGGACGTGGAGATCGC^ 
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MTGGCGMGGCGTTGGACMGTCAACATCTCTGTAGTGCAGTCCACCGT 
AGTGGCTTAGGCCTGGGTGGAGGMGCAGCTACTCCTATGGCAGT^ 

AGAGCCACTGGGGGTGGCCTCAGCTCTGTTGGAGGCGGCAGTTCCACCATCMGTACACCACCACCTCCTCCTCCAGC 
AGCTACAAGCACTGA 

5 423 

MNGPALQPSS PS S APSA S PAAAPRGWSEFCELHAVAAARELARQYWLFAREH PQHAPLRAELVSLQPTDLFQRYFCREVRDGRA 
PGRDYROT^RGPPAKAEASPEPGPGPMPGLPKARSSEELAPPRPPGPCSFQHFRRSLRHIFRRRSAGELPAM 
METPARPGIJMCKFLPWSIAREPPPEALKEAVLRYSLADEA 
ACSSIQEVRWCTRLEMPDNLYTFVLKVKDRTDIIFEVGD^ 
10 SLNQGASPGGLLDPACQKTDHFLSCYPV^HGPISRVKAAQLVQL 
RGQCRVQHLHFPSWDMLHHFQRSPIPLECGMCDVRLSSYVVW 
• GCPRGLSPEGLPGRSSPPEQIFHLVPSPEELANSLQHLEHEPVNRARDSDYEMDSSSRSHLRAIDNQYTPL 

424 

CCCGGGCCACCGCCTCCGCCCGGCTGCCCGCCCGGACTGTCGCGGCCCGCGGTGGCGACGGCGGCCGCTGCA^ 
15 CGGCGGCGGCCCGGGGGCGCATCCTCCCGCMCTGTCMGCGCTC^ 

GGGTTTCCTGCCTGAGCCCCGCTCGAGCGAGCCGCGAGCGAGGAGCCGGCGGGCGGGAGAGGACGCGCCCAGGGCGGGGGCCCG 

CCCGCCCCCTCGGGATTTCGAGGGCCCGGGGGCGCGCGACGCCATGGGCCGGCCGGGCCCAGAGCTCCTGTCTCTCAGCCCGGC 

CGCACCACCTGGGTC.TCCGCCATGMCGGGCCTGCCCTGCAGCCCTCCTCGCCCTCTTCCGCGCCCTCAGCCTCCCCGGCGGC^ 

GCCCCGCGGGGCTGGAGCGAGTTCTGTGAGTTGCACGCCGTAGCGGCGGCC^ 
20 CGGGAGCATCCGCAGCACGCGCCGCTGCGCGCCGAGCTGGTGTCGCTGCAGTTCACCGACCTCTTCCAGCGCTACTTC 

GAGGTGCGCGACGGACGGGCGCCGGGCCGCGACTACCGGGACACAGGCCGTGGGCCCCCAGCCAAGGCCGAGGCGTCCCCGGAG 

CCAGGCCCCGGCCCCGCCGCCCCTGGCCTGCCCAAGGCCCGCAGCTCTGAGGAGCTGGCCCCGCCGCGGCCGCCCGGGCCCTGC 

TCCTTCCAGCACTTTCGCCGCAGCCTCCGCCACATCTTCC 

GCCCCCGGGACC.CCCGGAGAGGCTGCTGAGACCCCCGCCCGGCCTGGCCTGGCC 
25 GAGCCGCCACCCGAGGCGCTGAAGGAGGCGGTGCTGCGCTACAGCCTGGCCGACGAGGCCTCCATGGACAGCGGGGCACGCTGG 

CAGCGCGGGAGGCTGGCGCTGCGCCGGGCCCCGGGCCCCGATGGCCCCGACCGCGTGCTGGAGCTCTTCGACCCACCCAAGAGT 

tcmggcccmgctacaagcagcttgctcca«:atccaggaggtccggtggtgcacacggcttgagatg^ 
acctttgtgctgaaggtgmggaccggacagacatcatctttgaggtgggagacgagcag 
ctctcggagtgcacaggccgagggctggagagcacagaagcagagatgcatattccctcagccctagagcctagcacgtccagc 
30 tccccmggggcagcacagattcccrtaaccaaggtgcttctcctggggggctgctggacccg 

- TTCCTGTCCTGCTACCCCTGGTTCCACGGCCCCATCTCCAGAGTGA 

CATGGAGTGTTCCTGGTGCGGCAGAGCGAGACGCGGCGTGGGGAATACGTGCTCACTTTCMCTOT 

CTGCGCCTGTCGCTGACAGAGCGGGGCCAGTGCCGTGTGCAGCACCTCCACTTTCCCTCGGTCGTGGACATGCTC 

CAGCGCTCGCCCATCCCACTCGAGTGCGGCGCCGCCTGTGATGTCCGGCTCTCCAGCTACGTGGTAGTCGTCTCCCM 

35 GGTTCCTGCMCACGGTCCTCTTCCCTTTCTCCCT^ 

CTTCCCCACCTTAGTTCTTCTGGCTGTCCCCGGGGGCTCAGCCCAGAGGGTCTCCCAGGGCGATC 
TTCCACCTGGTGCCTTCGCCCGMGMCTGGCCMCAGCCTGCAGCACCTGGAGCATGAGCCTGTG 
GACTACGAMTGGACTCATCCTCCCGGAGCCACCTGCGGGCCATAGACMTCAGTACACACCTCTCTGACCA^ 
AGGCCTCMCAGCTGCCCTTGAGGAGCACAGGCAGMGTGTGMOT 

40 ATTTMGGGACACTGTTAACTGCTCGTGCCAGTTTGGM 

ATCTATCCACCTGGCTTTCTCCTTATOTTTACAGATGTAGTTCTTGTTAGAGGATGCCGCTAGC 
GCCCAGTCCCCGTTACTCTTAGAGAMGGAGTTGGGGTGAGGGCCAGAGCTGGCAGTGGAMCTTGTTC 
CTGTCACAGCGGATGACAGACTTTCTACGGGGAGGAGGGGGGGATCATCAGGMG 
CTACCGTCAGTCCACATGGCAGGTCTGCTGTGTC^ 

45 CAAMGCTCTGTCAGAGGCACMGCTGMGGTCAAAMTGATTTAAAACATTTTACCTCAG 
GTTCAAMCTTAACCACTGCTTATTTCAGTGCACTGTTTCM 
ACCCTACCTMTTTACAGTCTGAGCCAGCATGCTGGCTTGTCTACTG 

GTCCTTCCTAAATGTCMGMGTGMGTATGTCACCCTTTCAGGGAAATTC ' 
AGTTCTATCCTGGTGCCTTACGMTAAAAMCTCGATTCTGGTTTACA 
50 GGGGAAMGCAACCAAAMGGGAAAMGGACTCCTAGGCCCTTTCTATTAMTCC™ 
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CMGCATCTMTGGCTTATTAAATTATC 

GCCATAAeTMTGAGAMCTCCTTTMTACCCCACMTCAGTGTTCTC 
ATCTATCTCCTTGCACACATGGGCACACACAATCTCC^ 
ACMTTCATAGGGGGCTGGCTCCTCCCAGAACCTGTCTGGAGGCTC^ 
5 GGTGCCAGCAGAGCCATTCAGTACMCCCCCAGGCTCACAGCAGTGGCTTCTAGGAMCTGGGAGm 
TACATCGATCAGAGGCTAAMTGAAACCTCAGCCTAAAACTCATA^ 
CACTTATACTTAGTCTCTGTGCTCCMGAGGTCAMTTTTTGCTTC 
MCCCCTATGCACTTTTCTTTTTTTTTT^ 

CACTGCMCCTCCACCTTCCTGGTTCMGCGATTCTGCCTCGACCTCTCM 
10 TGGCTMTTTTGTATTTTTAGTAGAGACAGGG 

CCGCCTTGGCCTCCCAMGTGCTGGGATTACAGGCGCGAGCCACCGCGCCCAGCCTAGACCACTTTTAGTACCAACA 

gtgatttcatggaccctamgcagacctgacactgatccagatttgcagtcc 

agtcmgcagctttctctggaamtgaatgctmttagtgtgmccaaaagagtaagtm 

aaagcttamtggamgtcactggtcctcccctccgcacaggaaaggtacccagtagatmtgmccam 
15 ctccagccagmgttamcatctgggatatgacgtcttcatgccaggggcactcatttcttagcagcctctc 

tcaggtggtgccmgaggcacaccaggtagagcamcttagcagctctgactmcaggctgcamgtgcm 

ggcagagatttggagggcacccacgtccagactgcttcccgtccmgttaccaggacagctcaaamcato 

cccatggctctaggaaamgtgacactmgccmcacctttctttatgtgggagcaamtc 

ttgtggggcaggcaccccactggctgcagctagcccaccataggcacagcacatcccaccactctccttccagtc 
20 ccccagccggcmcttctaccgagagccatggctcmcaccamctggacagtagacatca 

tacagaccccaccagtacagcttgacagctcccggcaccatccctrccttcatctgacttattgaacw 

caccagcaccamgaattmgtcmctmccikkcto 

tgcctttgatcatttctggaccgtaggaaamggmtagcaatc 

gtttcttmcctaaagtcmggccttggactcttccctgagggttgc 
25 ccttactgcamtgtgttagctctmcatctcccacaagctaaaggmcttc 

tgtgtttggttaamtatacagcacattgtgatmcatamgtggatccatcttgtatcatta 

atttttatgtatggttttatgtactgtacmgtmcttattcttgmtm 

GTGMTTAAAMGTTTCCAAMTCTTGAAAAAAAAAAAAAAAAAAAA 

425 .... 

30 MMAMPIQQNGTHTGVPIDLDPPDSRKRPL^ 

LSKLSKSKDFYPGTTERVCLIQGTVEALNAVHGFIAEKIREMPQNVAKTEPVSILQPQTTVNPDRIKQTLPSSPTTTKSSPSDP 
MTTSRANQVKI I VPNSTAGLI IGKGGATVKAVMEQSGAWVQLSQKPDGINLQERWTVSGEPEQNRKAVELI IQKIQEDPQSGS 
CLNI S YANVTG PVANSNPTG S PYANTAEVLPTAAAAAGLLGHANLAGVAAFPAVL SGFTGNDLVAITSALNTLAS YGYNLNTLG 
LGLSQAMTGALAAAMSMPAAAAANLLATYASEASASGSTAGGTAGTFALGSLAAATAATNGYFGM 

35 IXSSKDVVEIAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFVPGTRNRKVTITGTPMTQMQY 
NPQKVG 

426 

GMTTCCGACAAMCAAMGGGAGMCCTTCTCCCGGTAGCAGCGGCAGGAACTGCAAACATGATGGCGGCAGCTCCCATCCAG 
CAGAACGGGACCCACACTGGGGTTCCCATAGACCTGGACCCGCCGGACTCGCGGAAAAGGCCGCTGGAAGCCCCCCCTGAAGCC 
40 GGCAGCACCAAGAGGACCMTACGGGCGAAGACGGCCAGTATTTTCTAMGGTTCTCATACCTAGTTATGCTGCTO 
ATTGGGMGGGAGGACAGACAATTGTTCAGTTGCAAAM^ 

GATTTTTACCCAGGTACTACTGAGCGAGTGTGCTTGATCCAGGGMCGGTTGMGCACTC 
GAAAAAATTCGAGAMTGCCCCAAMTGTGGCCMGACAGAACCAGTCAGCATTCTACAAC 
CGCATCAMCAMCATTGCCATCTTCCCCMCTACCACCMGTCCTCTCCATCTGAOTCCCATGACCAC 
45 GTAAAGATTATAGTTCCCMCAGCACAGCAGGTCTGATMTAGGGM 
GGGGCTTGGGTGCAGCTTTCCCAGAMCCTGATGGGATC 

MCCGAAMGCTGTTGMCTTATCATCCAGMGATACMGAGGATCCACAMGTGGCAGCTGTCTCM 
GTGACAC^TCCAGTGGCAAATTCCMTCCMCCGGATCTCOT 
GCAGGGCTATTAGGACATGCTMCCTTGCTGGCGTTGCAGCCTTTCCAGCAGTTTT 
50 GCCATCACCTCTGCACTTMTACATTAGCCAGCTATGGATATMTCTC 
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ACAGGGGCTTTGGCTGCAGCAGCTGCCAGTGCCMCCCAGCAGCAGCAGCAGCCMTTO^ 
TCAGCCAGTGGCAGCACAGCTGGTGGTACGGCGGGGACATTTGCA 
TATTTTGGAGCTGCTTCTCCCCTAGCTGCCAGTGCCATO 
ATAGCAGTGCCAGAAMCTTAGTTGGTGCMTACTTGGCAM 
5 AGGATACAGATCTCCAAAAMGGAGMTTCGTACCTGGCACMGGMTCGGMGGTMCCATTACTGGM 
CAGGCTGCTCMTATTTMTTACACAMGGATC^ 
CCCCAGTTACACATCAGATTGTTTTMCCCCTCCTTTACCCCATTTT 
TTCTGTTATTMTATATMTTATGCAAATGAATC^ 
TTTCATAGAAAGAATTTTTTCTTGATCTGTTTT^ 
10 ATTTMGCCTGCCATTTTACCAGCATTATTGTAGTTTMTGATTC 

tctagatmcactamttmctactgttaggttgagtatggtggggtc 
tgtmgtcttacttcctgcttagggcacagtgmgta^ 
gcmtatagttgcataamgcactgtaamtatttaam^ 
amgcacctgttctccatctgmctacacmtggamtmtgctgca 
15 acaamgttctcacatggagtcccacctcttcagaggttgcacatttc 
. ggccgagacagtcattactccattaacatcctcactgttta 

GAMGTGCCMCAMTTATTGATAGCTGATAATGTTO 
ATTGTGTATMTTTGTTAACACTAGAMCCTAT^ 

TGCTCCCATGATGGAGTCTTTGCAGCATGGCGCTAGCAGCCMTGCAGTTTCTMTAC 
20 ATTTTTATGTCACCMCCAGACAGTATTTCCTGCATGCTTATTTAGMGAGGCAGCTTATCTTC 

TTGTCAGGCTTTTTGACAGGTCATTTCAGAGTMGCCTTTC 

GAGTGCCMCTGTCCAGGCCATTTGACACACCATCTGTTMCCTCTGAGTTTGCCCA 

GCCCTGTGCACTCAGCACTCATAGGATCATCCAGACTCTCATGCGGCATGC^ 

ATATCTGGCTGAGCMCTGMTTACAAMGAGMTTACTTO 
25 AMTGGCATCCCAGCTGCTCCTTTCTGTGCAACCMTTAMGAACMTGAGTGTGATGCTC 

CTCTCTGMCTGTGATCTTTGTCCTC^ 

TGTGCMTTTACTTTTGGACATTGAGMCTTGAMTTATTTCCTGATC 
MCTGTMGAGTAGACTCATTTTTTTTTTm 

TTTATTTAGAGAATGATTTTAAAAGGGMTGATATGCTTGTTTAMTGAMGAGA 
30 ATGACTATTTATCGTCGATACTCTC^ 

TATTATATTTACTCATTTTATCATTATAAATGTGTTTAGTTCATCATGTAGCATCAAAA 

427 

MAFIRKKQQEQQLQLYSKERFSLLLLl^EEYYFEQHRMHILHKGSHHERKIRGSLKICSKSVIFEPDSISQPIIKIPLR^ 
IGKHGENGMRHFTKAKSGGISLIFSQWFIKEH3WVAPYKIERGKMEYWELDVPGKVEDWETLLQLH 

35 ITAILQSRLARTSFDKNRFQNISEKLHMECKAEM^ 

LGLEVFCTEDDLCSDIYLKFYEPQDRDDLYFYIATYLEHOTAEHTAESYMLQWQRGHLSNYQYLLHLNNLADRSCNDLSQYP 
PWIIHDYSSSELDLSNPGTFRDLSKPVGALNKmERLLTRYQEMPEPKFMYGSHYSSPGmFYLVRIAPEYMLCLQNGRFDN 
ADRMFN S I AETWKNCLDGATDFKELI PEFYGDDVSFLVNSLKLDLGKRQGGQMVDDVELPPWAS SPEDFLQKSKDALE SNYVSE 
HLHEMIDLIFGYKQKGSDAVGAHNWHPLTYEGGTO^ 

40 YNASMDSPGEESFEDLTEESKTLAWNNITKLQLHEH 

MAL S SCLLLPGDATVITS SWDNNVYFYS I AFGRRQDTLMGHDDAVSK^ 

liAELEHDVSVDTISLNMSTLLVSGTKEGTVNIWDLTTATLMHQIPCHSGIVCm 

SSMTSDEPQTCFVVTONS\fljSGSQSGELLVTOLI^ 

428 

45 GMTTCGCCTCGATGGCGTTTATCCGGMGMGCAGC&GGAGCAGC 

CTGCTTMCTTGGAGGAGTACTACTTTGMCAGCATAGAGCCM 

AGAGGCTCCTTAAAMTATGTTCAAMTCGGTGATTTTTGMCCAGATTCMTATC 

GACTGTATAAAAATAGGAAAGCATGGAGAAMTGGAGCCAATAGACACTO 

TTCAGTCAGGTATATTTCATTAAAGMCATAATGTTGTTGCACCATATAAM 
50 TTGGATGTTCCCGGGAMGTGGAAGATGTTGTGGAGACGTTGCT^ 
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CAMCCGCCATGATMCAGCTATTTTGCAGTCTCGTTTAGCTAGMCATCATTTGACAA 
MGCTGCACATGGMTGCAMGCAGAMTGGTGACGCCTCTGGTGACTMTCCTGGACACGTGTC 
TATTTTCAGCCCCTCAACGGCTACCCGAAACCTGTGGTCCAGATMCA 
GGCCTCATGCCTCTGGGCTTGGMGTATTTTGCACAGMGATGATCTC 
5 GATAGAGATGATCTCTATTTTTACATTGCCACATACCTAGAGCACCATGTC^ 
TGGCAGCGTGGACACCTTTCCMCTATCAGTACCTC 

TACCCTGTGTTTCCATGGATMTACATGATTATTCCAGCTCAGMCTAGATTTC 

MGCCAGTAGGGGCCCTAAATMGGAACGGCTGGAGAGACTACTGACACGCTACCAGGAMTGCCTGMC 

GGGAGTCACTACTCTTCCCCGGGTTATGTACTTTTTTATCTTOT 

1 0 AGATTTGATMTGCAGATAGMTGTTCMCAGTATTGCAGAMCTTG^ 

TTMTTCCAGAATTCTATGGTGATGATGTGAGCTTTCTAGTCMTAGCCTC 

ATGGTTGACGACGTGGAGCTTCCCCCTTGGGCTTCCAGTCCCGAGGAC 

TATGTGTCTGMCACCTTCACGAGTGGATTGATCTMTATTTGGCTACAMCA 
GTATTTCATCCCCTGACCTATGMGGAGGTGT^^ 

1 5 ATCTTGGAATTTGGGCAGACACCAAMCMCTATTTGTGA^ 

CAGACCTCCAGTTATMTGCTTCTATGGCAGATTCCCCAGGTGMGAGTC 

GCCTGGMTAACATCACCAMCTGCAGTTACACGAGCACTATA 

AATGGATCTTCAGTATTCACMCATCCCAAGATTCCACCTTGAAGATC 

TCATTTTCAMTATGGCTTTATCGTCTTGTOT 
20 TATTTTTATTCCATAGCATTTGGAAGACGCCAGGACACGTTMTGGGACATGATC 

MCAGGCTATATTCTGCATCGTGGGACTCTACAGTGMGGTC^ 

CACTTTGACTTGCTC&CCGAGCTGGMCATGATGTCAGTGTAGA 

GGCACCAAAGAAGGCACAGTGAATATTTGGGACCTCACMCGGCCACCTTAATGCACCAGATTC 

TGTGACACTGCTTTTAGCCCAGATAGTCGCCATGTCCTCAGCACAGGMCAGATGGCTGTCTTAATGTCATO 
25 GGMTGCTCATCTCCTCCATGACATCAGATGAGCCCCAGACGTGCTTTGTCTGG 

TCTGGTGMCTGCTCGTTTGGGACCTCCTTGGAGCAAAMTCAGTGAGAGM 

TGGATGMTGMCAGTGTAGCAGTATCATCACAGGAGGGGMGACAGACAMTTATATTCTGG 

TTTCCTCTCCTGAATATTAMTTGMCTCTATTTMTGCATTTTT 

AGTMTTAGMGTTTTACCACATGGAAMTTTGTGGTTTTAMCTTO 
30 CATTCTCTTAGGGCAGATAAMTGCGGCTGTGmGGAAAMCATGTTACACTGTMGGCAGATGATCGTC 

TGTCAGMGACAGGACTMGTAGCAGAGMTAGCTMGAGATAMTTGGGCTGGGGAMCTTC 

AGAMTTTTCCMGAAMTGTGCAGTATTCTCTGCTACTTCTGAATC 

CGGGTTTTGGGTGTGTGTTTTCATAGCGTGGTTACT^ 

GTTCTTAGTMGTTTACTGTGTATAGGMCGGTTTGTATTTCATTACAGCTATTC 
35 TAAAAAAAAAAAAAAAAAAA 

429 

MLALRCGSRWLGLLSVPRSVPLRLPMRACSKGSGDPSSSSSSG 

EKGFGYKGSTFHRVIPSFMCQAGDFTNMGTGGKSIYGSRFPDENFTLKWGPGVLSMMAGPNTNGSQFFICTIKTO 
WFGHVKEGMDWKKIESFGSKSGRTSKKIVITDCGQLS 

40 430 

GMTTCCGGAGTTCCGGGCGCGCGCGACGTCAGTTTGAGTTCTGTGTTCTCCCCGCCCGTGTC 
TGCTGGCGCTGCGCTGCGGCTCCCGCTGGCTCGGCCTGCTCTCCG 

CCTGCAGCMGGGCTCCGGCGACCCGTCCTCTTCCTCCTCCTCCGGGMCCCGC'TCGTGTACCTGGA 
AGCCGCTCGGCCGCGTGGTGCTGGAGCTGMGGCAGATGTCGTCC 
45 AGMGGGCTTCGGCTACAAAGGCTCCACCTTCCACAGGGTGATCCCTTCCTTC 
ATGGCACAGGCGGGMGTCCATCTACGGAAGCCGCTTTCCTGACGAGMCm 
CCATGGCTMTGCTCX5TCCTMCACCMCGGCTCCCAGTTCTTCATCTGCACCAT 
TTGTGTTCGGTCACGTCAMGAGGGCATGGACGTCGTGMGAAMTAGAATC 
AGATTGTCATCACAGACTGTGGCCAGTTGAGCTMTCTGTGGCCAGGGTGCTGGCATGGTGGCA 
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CAGGTGGCCGCGTTGGGCTGTCAGCCMGGTGCCTGAMCGATACGTGTGCCCACTCCACTC 
TGCTAGGGATGTTAGACGGAATTCC 

431 

MEPKRIREGYLVKKGSVFNTWKPMWWLLEIX5IEFYKKKSDNSPKGMIPLKGSTLTSPCQDFGKRMFVFK 
5 FLEERDAWVRDINKAIKCIEGGQKFARKSTRRSIR^ 
QSVRNRQEGUftASSLLNEGYI^PAGDMSKSAVDGT^ 
KQGHRRKNWKVRKFILREDPAYLHYYDPAGAEDPLGAIHLRGCW^ 
EWIKAIQMASRTGK 

432 

10 ggcccagctgctgagaggagttgcctgagagtgacct™ 

ggctaccttgtgaagmggggagcgtgttcmtacgtggamcccatgtgggm 

mgmgaamgtgacmcagccccamggmtgatcccgctgamgggagcactctgactagcccttgtc 

aggatgtttgtgtttmgatcactacgaccamcagcaggaccacttc 

gttcgggatatcmtmggccattaaatgcattgm 
1 5 ccagaaaccattgacttaggtgccttatatttgtcc^ 

aagaagatttttaatcactgcttcaca 

ggcctcatgattgcttcatcgctgctcmtgaggggtatctgcagcc 

gctgaaaaccctttcctggacmccctgatgcc 

GATGATGTGATTCTGAMGMGMTTCAGAGGGGTCATTATCMGCAGGGATGTTTACTGMGCA 
20 TGGAMGTGAGGMGTTCATCTTGAGAGAAGACCCTGCCTACCTGCACTACTATGACCCTGCTGGGGCAGMGA 
GCMTTCACTTGAGAGGCTGTGTGGTGACTTCAGTC^AGAGCMCTCAAATGGCA 

ATCATCACAGCAGATGMGTGCACTATTTCTTGCMGCAGCCACCCCCAAGGAGCGCACAGAGTGGATCAMGCCATC 
GCCTCCCGMCTGGGMGTAMGAGACTCCTGCATTCCTTC 
TGTCCACTTCTGTGACAMTCMCGGGAMCAGCrc^^ 
25 TGTGGTGTGCMGGTTCCCCTGCATTGTATTGCTCACTGCAGCCCCTCTGCCCCTATC 

CTGTGCAGCCTCAGTAGGCTGCTTGCCCTCTCCAGCCTCAGGGCCTCTTCTGGAAMTGM 

AGGTCCCCCTAGCTTAAAAAAAAAAAMTCTGCCCCATGATTCTMCACTCGCAGTAGTGATAGTGTATCTAGTTGTO 
GTGTCCTTCCTTGGCTMGTCTTGGCCTTCAGTTATCTTCAMTGTACCAGMCCTG^ 
CTGATCTGTAGTACAGTACCAGGMGAMCCTCTTTTGTTCTCTTTAGACATCTTC 
30 ACAAMTGMGGAGGGCTCTCTTCTTTCT 

TTCTATTACTTTTGGCAGCCTGGAAAGTTGTGTCTTCTC 
TGCCCTTAAACCGCCCAGAGGAGCCCTATTCCACTCTGGTTTTA^ 

atttctctmcgtttattacmttaggagggggaccccacatctgtgagattctg 

gtggccagatgtgttccccccatgggtgagaggcctcggcmctgcctggtgmtgtc 
35 gggctgmctactggccagctcactggatgatgggttmt^ 

agaggggctgttagcattgcgcagcatcttcagttctccagtamtgatattgcg™ 

agcagctcctgcttgagttctgagggcatcatggccctatc^^ 

ttgcmtttttgaccctgaccactmctagtgato 

gaaagtatatttttcmttccagatttttmttttm 
40 mtcctctcttgccctccctcmttcccctgtmcattcctgmgctgttccca 

gtaaagmctgtctttttctctgattctttmtamttatct™ 

atttattmtgggaagtcmcttmtgttttgamtaaatatatgactc 

433 

MLLQSQTMGVSHSFTPKGITI PQREKPGHMYQNEDYLQNGLPTETTVLGTVQI LCCLL I S SLGAILVFAPYPSHFNPAI STTLM 
45 SGYPFLGALCFGITGSLSIISGKQSTKPFDLSSLTSNAVSSWAGAGLFIi^SMVALRTASQHCGSE^YLSSLPYSEmP^ 
YEI KDCLLTSVSLTGVL WML I FTVLELLLAAYS SVFWWKQLYSNNPGS SF S STQSQDH IQQVKKS SSRSWI 

434 
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MGTCTCAGAGGCTGGAGAGCAGAGCACCMGATCGTTCTGGCAGGMCAGCCAGTG<^AGGTTCCAGCTGAGCGCTCCCCAGA 
GGTGAGCTGATCCCCAGCCACAGCACACAGGACCAGGCTGCGAGMCAGCATCATCAGCATCATGCTATTACAATCCCAAACCA 
TGGGGGTTTCTCACAGCTTTACACCAMGGGCATCACTATCCCTCAMGAGAGA^ 
ACCTGCAGMCGGGCTGCCMCAGAMCCACCGTTCTTGGGACAGTCCAGATCCTGTGTTGCCTC 
5 CCATCTTGGTTTTTGCTCCCTACCCCTCCCACTTCM 
CTCTGTGTTTTGGCATTACTGGATCCCT 

CAMTGCAGTGAGTTCTGTTACTGCAGGAGCAGGCCTCTTCCTCCTTGCTGACAGCATGGTAGCCCTC 
ATTGTGGCTCAGAMTGGATTATCTATCCTCATTGCCTTATTCGGAGTACTATTATCCM 
TGACCAGTGTCAGTTTMCAGGTGTCCTAGTGGTGATGCTCATCTTCACTGTG 
10 TCTTTTGGTGGAAACAGCTCTACTCCMCMCCCTGGGAGTTCATTTTCCTC 
AAMGAGTTCTTCACGGTCTTGGATATMGTMCTCTTC^ 
ATTAGACTTTCCTGAMTCTCTGCCATTTTAGATACTGTGAMCAMCTAAAAAAA 
TATGAGTCGTTATTTMTTTCTCTTGAAMTMTTTCCTCAMGCCCMGTCAATAAA 

435 

15 MVNVPKTRRTFCKKCGKHQPHKVTQYKKGKDSLYAQGRRRYDRK^ 
IKRCKHFELGGDKKRKGQVIQF 

436 

GGCGAGAGCTGCGAMGGCGAGAGCTGCGAAGGGCCAGGTGTCGGGCGCTGTTTCTCGmTCATCATATAGAC 
TGCTGCAMGATGGTCMCGTACCTAAMCCCGMGMCCTTCTGTM 
20 GTATMGMGGGCMGGATTCTmTATGCCCAGGGM 

GCCMTTTTCCGGAAGAAGGCTMGACCACAMGMGATTGTGCTMGGCTGGMTGTGTTG 
GATGCTGGCCATTMGAGATGCMGCATTTO3AACTGGGAGGAGATAAGAAGAGAMGGGCCMGTGATC 
GGGATATTTTTCTTCMTTTTGMGAGAAMTGGTGMGCCATAGAAM 
ACGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

25 437 

matngskvaixjqistevseapvandkpktlwkvqkkmdlpdrdtwkgrfdflm 
pyfltlifagvplfllecslgqytsigglgvwklapmfkgw 
ntdrcfsitcsmvnttnmtsawefwe 
ilffrgwlpgakegilfyitpnfrklsdsevwldmtqiffsygi^ 
30 vifsivgfmahvtkrsiadvmsgpglaflaypeavtqlpisplwailffsblllmlgidsqfctveg 
elfimvciisyliglsnitqggiyvfklfdyysasgmsllflvffe^ 
piivagvfifsavqmtpltmgnwfpkwgqgvgwlmalssmvlipgymymflalkgslkqriqvmvqp 
agsstskeayi 

438 

35 GMTTCCGCTCCGGCCGCAGGATCTCCCCMGGTGGCAGMGGAGGCOT 

' GAGGCMCTCCMGGTCCTACTCTCTTTCTGTGCCTGTTACCCACCCCGTCCTCCTAGGGTGCCC " 
TCCACGTGGACCGGGGGTGACATCGCACGTCCATCTGCCAGGACCCCTGCGTCCAM 
MGGTGGCCGACGGGCAGATCTCCACCGAGGTCAGCGAGGCCCCOT 

CAGAAGMGGCGGCAGACCTCCCCGACCGGGACACGTGGMGGGCCGCTTCGACTTCCTCATGTCCT^ 
40 GGCCTGGGCMCGTCTGGAGGTTCCCCTATCTCTGCGGGAAAMTGGTGGGGGAGCCTTCCTG 

ATCTTTGCGGGGGTCCCACTCTPCCTGCTGGAGTGCTCCCTGGGCCAGTACACCTC 

GCTCCTATGTTCMGGGCGTGGGCCTTGCGGCTGCTGTGCTATCATTCTGG 

GCCATTTACTACCTGTACMCTCCTTCACCACGACAC^ 

TCCAACTACAGCATGGTCAACACTACC^CATGACCAGCGCTGTGGTGG 
45 GGGCTGGATMGCCAGGTCAGATCCGCTGGCCACTGGCCATCACGCTGGCCATCGCCTGGATCCTTC 

AAGGGTGTTGGCTGGACTGGAMGGTGGTCTACTTTTC 

GTGACGCTGCCCGGGGCCMGGAGGGCATCCTCTTCTACATCACACCCMCTTCCGCMGCTGTC 
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GATGCGGCMCCCAGATCTTCTTCTCATACGGGCTGGGCCTGGGGTCCCTGATCGCTCTCG 
MTGTCTACAGGGACTCCATCATCGTCTGCTGCATCMTTCGTGCACCAGCATGTTCGCAGGATTCG 
GGCTTCATGGCCCATGTCACCAAGAGGTCCATTGCTGATGTGGCCGCCTCAGGCCCCGGGCTGGCGT^ 
GCGGTGACCCAGCTGCCTATCTCCCCACTCTGGGCCATCCTCTTCTTCTCCATGCTGTO 
5 TGCACTGTGGAGGGCTTCATCACAGCCCTGGTGGATGAGTACCCCAGGCTCCTCCGCMCCGCAGAGAGCT 
GTCTGCATCATCTCCTACCTGATCGGTCTCTCTMC^ 
GCCAGTGGCATGAGCCTGCTGTTCCTCGTGTTCTTTGAATGTGTCTC 
MTATCCMGAGATGGTTGGATCCAGGCCCTGCATCTGGTGGAMCTCTGCTGGTCm 
GTGTTCATTTTCAGTGCTGTGCAGATGACGCCACTCACCATGGGAMCTATGTTTTCC 

10 CTGATGGCTCTGTCTTCCATGGTCCTCATCCCCGGGTACATGGCCTACATGTTCCTCGC 

ATCCMGTCATGGTCCAGCCCAGCGMGACACTGTTCGCCCAGAGMTGGTCCTGAGCACGCCCAGGCGGGCAGCTC 
AAGGAGGCCTACATCTAGGGTGGGGGCCACTCACCGACCCGACACTCTCACCCCCCGACCTGGCTGAGTGCG^ 
TGTCTGAGGATACCTTCCATCTCMCCTACCTCGAGTGGTGATCCAGACACCATCACCACGCAGAGAGGGGAGGTG 
GTTAGACCCCTGGGTGGGCCCTGCCGTGGGCMGGATACCCGGTGGCTTCTGGCACTGGC 

1 5 GCCCCATCAGCATCCCACTCCTGGCGGGAT 

439 

MEGYSEEASLLRHLEKVASEEEEVPLVVYLKENMLLTANGLHLSQNREAQQSSPAPPPAEVHSPAADVNQNIA 
SNSSHNPPATDVNQNPPATWPQSLPLSSIQQNSSEAQL^ 

TFSREATLIPSSRPPASDFMSSSLLIDIQPNTLWSADQEMSGRAMTTPTKVYSEVHFTLAKPPSV^ 
20 QMERSPMLERRHFGEKAPAPQPPSLPDRSPRPQRHIMSRSPMVERRMGQRSPASERRPLGNFTA 

PPSYSVLYPSSDPKSSHLKGQAVPASKTGILEESMARRGSRKSMFTFVEKPKVTPNPDLLDLVQTADEKRRQRDQGEVGVEEEP 

FALGAEASNFQQEPAPRDRASPAAAEEWPEWASCLKSPRIQAKPKPKPNQNL^ 

LMCPSPTMSLPSSWKYPTNAPGAFRVASRSPARTPPASLY^ 

SPMPSSLDLVPlSnjPKGALPPSPALPRPSRSSPGLYTSPGQDSLQPTAVSPPYGGDISPVSPSRAWSPRAKQAPRPSFSTRNAG 
25 IEAQDRRESLPTSPPWTPGASRPPSSLDGWVSPGPWEPGRGSSMSSPPPLPPPPPMSPSWSERSVSPLRPETEARPPSRQLQAL 
LARNIINAARRKSASPRSAGAENPRPFSPPRAPPPPPPPPPPPPIWSPQPARPGSMW 
PLPAPPRPFLYRRSPTDSDVSLDSEDSGAKSPGILGYNICPRGWNGSLRLKRGSLPAEASCTT 

440 ' ' . . . . 

GAGAAAAGTCACATCCAGCTCCTTCATGTTGCTGCCGATGTGGGATTTTCCTGGCTTCGTC 

30 CCTOAGGCTmAGAACC^^ 

GGGCCGGAGCACTAGCCTCACGGAGMGGATCTGAMGMGCCMGGCGCGGAGCCAGCAGATTGCAGCC 
TCCCAGCTCCMTTCCCGTGGCGTCCAGCTCTTCMCAGGCGCCGGCAGAGGGTGMCGAGTTCACCTTGGAGAGCCACGGCC 
GAGGGGACAGMGCCCAGCCAGGAGTCCCTCAGAGTGCTCCCTTCMGCCTCCCAGGCGATGCACCGGGGCTCAGCCTGAGTTC 
CACCTCGCTGCCGGAGCCAGGCCCTCCACGGCACCCCAGTCCCCAGAGCCCCGACAGAGGGGTCCCTGGCCACAGCATGGAGGG 

35 GTACTCAGAGGAGGCTAGCTTGCTGCGGCACCTGGAGMGGTGGCCAGTGAGGAGGMGAGGTACCACTGGT^ 
GGAGMTGCAGCACTGCTGACAGCCMTGGGCTGCACCTGTCCCAAMCCGAGAGGCCCAGCAGTCCyrc 
AGCTGAGGTCCACAGCCCAGCTGCAGATGTCMCCAAMCCTTGCCTCGCCCAGTGCCACGCTCACCACACCAACTTCTAACAG 
CAGCCACMTCCGCCAGCCACCGATGTCMTCAGMCCCACCGGCMCTGTTGTCCCACAGAGCCTGCCACTTTCTA^ 
ACAGMTTCCTCAGAGGCCCMCTCCCATCTMTGGCACAGGGCCTGCTTCCAMCCCAGCACC 

40 CCAGGCACCGGCTGAGGAGGTGAGATGCAGCACACTCCTMTTGACAAGGTATCAACTCCAGCTACCACCACCAGCACCTTC 
' CAGAGMGCTACGCTCATCCCCAGCTCCAGGCCCCCAGCCTCAGATTTCATGTCCAGCTCCCTGC 
CACCCTAGTGGTGTCAGCAGATCMGAGATGTCTGGGCGAGCAGCTGCCACCACGCCCACCM 
CACACTGGCCAAGCCCCCATCAGTGGTCMCAGGACGGCCAGGCCTTTTGGGATCCAGGCGCCAGGGGGCA 
GAGGAGCCCCATGCTAGAGAGACGACATTTTGGGGAGMGGCCCCGGCTCCCCAGCCCCCCAGTTT^ 

45 GCCACAGAGACACATMTGTCCCGCAGCCCCATGGTGGAAAGGAGGATGATGGGGCAGCGAAGCCCX^CCTCAGAGAGACGCCC 
CTTGGGGMCTTCACTGCACCCCCCACCTACACTGAGACCTTGTCCACAGCCCCTCTGGCTTC 
ATATTCTGTCCTGTATCCCAGCTCCGACCCCMGTCTTCTCATC 

GGAGGAGTCGATGGCCCGCCGGGGCAGCCGCAMTCCATGTTTACTTTCGTGGAGAAGCCCMGGT^ 
GCTGGATCTXSGTACAGACAGCGGATGAGMGCGGCGGCAGAGGGACCAGGGGGAGGTAGGCGTGG^ 
50 GGGGGCCGAGGCCTCCMCTTCCAGCAGGAGCCAGCACCTCGTGACAGGGCCAGCCCCGCGGCGGCGGAGGAGGTGGTACCAGA 
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GTGGGCCTCCTGCCTCMGTCACCCCGCATC 

GGGAGCTGAGCTCTA^CCCGCCGCCAGTCACGGATGGAGAMTATGTCATCGAGTCTTC 
CTGCCCATCACCTACCATGTCCCTGCCTTCCTCCTGG 

CCCAGCCCGGACCCCGCCTGCCTCCCTCTACCATGGCTACCTGCCTGAGMCGGGGTCCTGCGCCCAGAGCCCA 
5 GCCATACCAGCTGCGCCCCTCGCTCTTTGTCCTCTCACCTATCMGGAGCCTGCCMGGTCTCACCMGA^ 
CMGCCCAGCTCCTTGGACCTGGTGCCCAACCTGCCCMGGGGGCTCTCCCTCCATCTCCTG 
CTCACCGGGCCTCTACACCTCCCCCGGCC^^ 
CGTGTCTCCCTCCAGGGCGTGGTCTCCCCGA^^ 

TCAGGACCGCCGGGAGAGCCTGCCCACCTCCCCACCCTGGACGCCGGGCGCGTCCCGGCCCCCCAGCAGCCTAGACGGCTGGGT 

10 GAGCCCGGGCCCGTGGGAGCCAGGTCGCGGGAGCAGCATGAGC 

GTGGAGCGAGCGCTCGGTGTCCCCGCTGCGACCTGAGACCGAGGCGCGGCCCCCCAGCCGCCAGCTGCAGGCG^ 
AMCATCATCMTGCGGCCCGGCGCMGAGCGCCTCCCCGCGGTCGGCGGGCGCCGAGMCCCGCGGCCCTTCTCCCCGCCGAO 
GGCGCCACCGCCCCCGCCCCCGCCCCCGCCCCCGCCCCCGCGCATGCGCTCGCCACAGCCCGCCCGCCCCGGCTCGGCTGCTGT 
GCCGGGGGCAGCCTTCGCGCCCATCCCGCGGAGCCCGTTGCCCGCCGGTCCTTCGTCCTGCACCAGTCCCCGGAGCCCG^ 

15 CGCGCCTCCCAGGCCCTTCCTCTACCGCCGCTCGCCCACGGACTCCGACGTGTCCCTCGACTCCGAGGACTCCGGGGCTMGTC 
TCCAGGCATCCTGGGCTACMTATCTGTCCCCGCGGGTGGMTGGCAGCCTTCGGCTCM 

CTCCTGCACCACCTAGAGCCCCACCCCCGACCCCACCCCGGGAGGGCAGAGCCAGAAGMGGCTCATTAGACCTGGGGGACCCA 

MGGGTCTGGCCTCTTTGGGCAGCCCCAGAGATGAGGGGTCAGCAGAGGAGAGCTCTGGGGTTC 

GCTTGAGTTCTAGCCCTTGCTCTCATTCAGCTGTTGTGTGACCCTGGGTAAGACCCTTC 
20 CTGTTTMTGGTGGCTTTGGCCMGGCMTCCACAMCGTC 

TCTCTCTTTCTCTCTCTCTCTCTCTCACACACACACACACACACACACACACACACACACACACACACACACACTAGTTAGTGC 

CTTGGATGAGGCGGGGCAGTGTGTATATGGACCCCTGGACTTGCTACCTTCAGGGTT'CCATACTC 

TGCTGTCTGGAGTCTGGCMGCGGGGTGTGTTCAGMGG 

CTCCCTGGCACCMGTCCCAGGCAGGAGCAGCTGTTTTO 
25 AGAGGTCTCCCAGGCCAGCTCMGGTGTCCCACTATCCCCTCTGGAGGGMGAGGCAGGAAMTTCTCCCCGGGTC 

GCTACTTTCTCCATCCCAGTTCAGACTGTCCAGGACATCTTATCTC 

GCCCAGGACTTGGGCCTCCAGCTCATCTGTTO 

AAGAGATACAGAGCTACCATGTGACTTTACCTGAm 

CTTGTTACGGGAAATATGAAMGCATGGCCAGGATGCATAGAGGAGATTCTAGCAGGGGACAGGATTGG^ 
30 AGGGCTCTTCCAGTCTTGAAATGCATTCCATGATATTAGGMGTCGGGGGTGGGTC 
TAGGGGCCGATGAGCTTGGGTACGTGAGCAGGGTGTTMGTTAGGGT 
CTTCAGTGTCAGACCTCAGTCCCAGTGTCCATATCGTO 

TCTGACCTAGCTAGTGCCTGGTGCCCAGTGACCTGGGGGAGCCTGGCTGCAGGCCCTCACTGGTTC 

TGATTCAGGTCCCCAGGGGGGACTCAGGGAGGMTATGGCTGAGTTCTGTAGTTTCCAGAGTTGGCTGGTAGAGCCTO 
35 GTTCAGMTATTAGCTTCAGGATCAGCTGGGGGTATGGMTTGGCTGAGGATCAAACGTATGTAGGTGAM 

TTGCTAMGGTGAGGGACAGTTTGGGTTTGGGACTTACCGGGGTGATGTTA 

TMGGTCAGTATGGMGATAGGGTTGGGACAGGGTGCTTTGGAATGAMGAGTC^ 

GCTCCTGCTGCTGTGMGATGAGAAGGTGCTCTTACTCAGTTM 

GGGTCCCTTGTAGCACAGGAGACTGGGGCTMGGGCCCCTCCCAGGGMGGGACA 
40 TAGAGGATCCATTTCTACCTGCATTTCCCAGAGGACTAGCAGGAGGCAGCCTTGAGAAACCGGCAGTT 

GCTGTTCTCTCATTGTCACTGCCCTCTCCCCMCCTCTCCTCTMCCCACTAGAGATTG^ 

TAGAATGCAGCTCTGGCCCTCAATAAATGCTTCCTGCATT 

441 

MFGFHKPKMYRSIEGCCICRMSSSSRFTDSKRYEKDFQSCFGLHETRSGD^ 
45 SLKTTLKPKKVKTLSGNRIKSNQISKLQKEFKRHNSDAH^ 

RQKICCGI I YKGRFGEVLIDTHLFKPCCSNKKAAAEKPEEQGPEPLPISTQEW 

442 

AGACTATCTTTCTAGACMGGCAGTTGAGGAGGAGGG 
ATTATTGCGCGTGGMCGGCTGCTTTTGGMGGCACMCTTC 
50 TCACCAMCAGCTTCAACCCTGAMCCAGGACGAGAAGTTGACAACATCTGAGTGGACAGOTAATTGACC 
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AGACTATTGCCCAGMGAAMGATGTTTGGTTTTCAC 

TMGTCCTCCAGTTCTCGATTCACTGACAGTAMCGCTATGAAMGGACTTC 

AGGAGACATCTGCAATGCCTGTGTCCTGCTO 

GGTAGATGCMGGGCTGGACCCAGTCTAMGACTACATTGAMCCAMGAMG 
5 CMCCAGATCAGTAMCTGCAGMGGMTTTAMCGTCATMTTC 

ATCTCCTTGTTACAGTMCCAGTCAGATGACGGCTCAGATACAGAGATGGCTTCTGGTTC 
TTTAGATCTCACTTACTGGAAAAGACAGAAGATATC 

ACATCTCTTCMGCCTTGCTGCAGCMTAAGAMGCAGCTGCTGAGMGCCAGAGGAG 
CACTCAGGAGTGGTGACTGAGGTTTTTATGTAGMGGGGMCAAAAAAAAAM^ 
1 0 CTGACCCTCTTTTTITITPTTTTTTT^ 

CACTGCMCTTCCGTCTCCCGGGTTCMGTGATTCTCCTGCCTCAGCCTCCCM 
ACCCGGCTMTTTTTTAGTTTTAGTAGAGACGG^ 

TTGGATACCTGCTATTCTGCCAAMGACMTTTCTAGAGTAGTTTTGAATGGGTTGAm 
CCAGTGTCTAGCTTACTAAAAAMGAGTTGTATATAATATTTM^ 
1 5 AGTGCTCTTTMGTCTTTTTT^ 

GTATGATAMCTGTATGTAGTTTTTAGTCTTTC 

TTTTCCCTCATGATMTAMTTTGTCATMCTCAGTMCATGMCTTGCCCCTAGAGGTAGTTGTTAAT 
AGGTCTTGCCAAGCTTCTGATGATTCACACCTC^ 

GGAGGAGAMGTMTTTCTAMTATACAGAGAGGTMCTTGACTATATATGTTGCATCCTGTGCCTC^ 
20 GATAMGATTTTMTTTATGTAAMCTTCTAMGCAGMTCAMGCTCCTCTTGGGG 

ACCCTGTATGMTAGTACCAMGCATTACCGCATGGTAGAGMCACACTCGATTAAA 

GTGCMGTCTTCAGGATGGCACAAMCAMGGTTMTGCTTCTTGGGGCACATTTCTTAGAGGG 

MTCGACTTTTGTTTGTGTTACATGACTTCTGTC 

TGACCTTTGTGAAGGTTTTTATC^ 
25 ATTAAMGTAAMCTTACTAAMGAAMGAGGTTTCTGTTCACATTAMTGGTTT^ 

GAACATTGAGATATCCTGMCTTAGAGCTCTTCMTCCTMGATTOT 

TCTTACTGCCTCTTTTCTAMTGTTCAGGAAMGCATTGCCAGTTCAGTCTTTTC 

GTMTMCMGACTCAGTGCTTATTTTTTAMCTGCATTTTAAAMTTGGATAGTATMT 

TATAGGCACCCTGTAGTTTTATAGTTCTTMTCTAMCAT^ 
30 CCATATGCACAGACTATACAGTGAGTTGAGTTGGCTCTCCCACAGTCCT 

TCCTGAGTTATTTGAAATGATTTTTTTTG 

CTTCTGTATTTATTTATTTATTACTAGACCTCAACCACAGTCTTCTTTTTC 

GTATGTAGTCATGCACTTTGTATTMTATATTAGAAATCTACAGATCTGTTTO 

AAAACTTTTACTAGGGTATTGAATAMTCTAGTCTTACTAGAAAAAAAAAAAAMGAAAAAAAAAAAA 

35 443 

MSQWCRPVAMDLGVy^LRHFSISFLSSLLGTE^ 

IEKNSQLEQENNLLKTIASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA 
444 

CGCCTCTTCACGGCACTGGGATCCGCATCTGCCTGGGATCATCM 
40 TTCTGTTTCTCCCTGGGCTGCGGAMGCCAGMGATTTTATCTAGCTTATACM 

AGGGTGTTTTTGGCTGCMTTGCATGAMTCCCMTGGTGTAGACCAGTGGCGATGGATCTA 
TTTCMTTTCTTTCTTGTCATCCTTGC 

TAGCTATTGACAACAAMTCGAGCMGCTATGGATCTAGTGAAAAGCCATTTGATGTA 
TCAMGAGCAMTCAMGMCTMTAGAGAAAMTTCCCAGCTC 

45 AGCAGCTTGCCCAGTTTCAGGCCCAGCTGCAGACTGGCTCCCCCCCTGCCACCACCCA 
CCCAGCCAGCATCGCAGGGCTCAGGACCMCCGCATAGCTGCCTATGCCCCCGCAGM^ 
ACCXSAGMGATGTGCTAGGGAGAATCTGCCTCCACAGTCACCCATTTCATTGC^ 
GCCATTATCTCTTTTCCAGTATTAMCACTCATATGCTTATGGCTTGGAGAM 
GAGMTTAGCATGGATATACCGGGACCTCATGCAGOT 

50 CCTTTCCATCCCTTCCGGCTCCCTACCCCTCAC^ 
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AAGCGATGGAGCTCCTCACTGGACTCGCCTCTCTCCTCTCCTCCCCCCAGGAGGMCTTC 
GAGGGGGMCAGAGTTCACTGTACAMTTTGACMCTGTC 
AMCMTGGATGACACAAMCTATGAGAGTGACAAAATGGTGACAGGTAGCTGGG 
TTGCTTATTGTATATTTGTGTATGTAGTGTAACTATTTTGTACMTAGAGGACTGTMCT 
5 ATTTAGTTGTTTCATTGGCTGTCTGAGGAGGTGTGGAC 

ACCTAAACCCCTTTTMGMTTTGGCACAGTTACTCACTTTGTGTMTCTC 
TTGTTCACTGAAGTCTTTCMTTGAGCTGGTTGMTACTTTGAAAMTGCTCAGT^ 
GGGTTTCTGCATATCACCTGTATAGTAGTTATATGCATATGTTTC 
TTAACTGTTGCACTTGTCMCT'TTCAATAAAGCATATAAATGTTG 

10 446 

TTT-TTTTTTTTTTTTTTTTCACT^ 

GCACACAGTGGGACAGGCCACAGGACAGGCTGGGAGTGMTAMGAGTTCACACTGCTTCCCTGCTTTC 
CTCTAGGCCACCTCCTCCTCAGCCTCCTCCTCGMCTCGCCCTCCTC 

ACCAGGTCATTCATGTTGCTCTCGGCCTCGGTGAACTCCATCTCGTCCATGCCCTCGCCCGTGTAC 
15 CGCGGNAACATGGCCGTGMCTGCTCGGAGATGCGC 

AmTTMCCCCGAGGTGGGATGTCACGACAGCCGTTTTCACATTGTTGGGGGATCACTC 

447 

MLTELEKALNSIIDVYHKYSLIKGNFHAVYR^ • 
KAMKKATKSS . 

20 448 

ATGTCTCTTGTCAGCTGTCTTTCAGMGACCTGGTGGGGNMG 

CTCTATCATCGACGTCTACCACMGTACTCCCTOATAMGGGGAATTTC 

AGAGACCGAGTGTCCTCAGTATATCAGGAAAMGGGTGCAGACGTCTGGTTO 

TMCTTCCAGGAGTTCCTCATTCTGGTGATAAAGATGGGCTGGCAGCCCACAAAAAAAGCCATGMGA^ 
25 CTGAGTTACTGCCCAGAGGCTGGGCCCCTGACATGTACCTGCAGAATMTAMGTCATCMTACCTCAAAAAAAAAAAAAAAM 
AAAA 

449 

MDIRKFFGVIPSGKKLVSETVK^^ 

PPEKLPVSSKPGKISRQDPVTYISETDEEDDFMCKKMSKSKENGRSTNSHLGTSNMKKNEENTKTKNKPLSPIKLTPTSVLDY 
30 FGTGSVQRSNKKMVASKRKELSQNTDESGLNDEAIMQL^^ 

SVQANLSKAEKHKYPHKOTTAQVSDERKSYSPRKQSKYESSKESQQHSKSSADKIGEVSSPKASSKLAIMK^ 

ASKRKENAIKLKGETKTPKKTKSSPAKKESVSPEDSEKKRTNYQAYRSYLNREGPKALGSKEIPKGAENCLEGL 

IERDEMSLIERYGGKVTGNVSKKT^LVMGRDSGQSKSDKAAALGTKIIDEDGLIJ^ 

RTPQKWQGKRKISPSKKESESKKSRPTSKRDSLMTIKKETDVFWKSLDFKEQVAEETSGDSKARN 
35 KYKPTSLKTIIGQQGDQSCAmLRWLRWQKSSS 

VELNASDTRSKSSLKAIVAESLNNTSIKGFYSNGMSSVSTKHALIMDEVDGMAGNEDRGGIQELIGLIKHTKIPIICMCND^ 

HPKIRSLVHYCFDLRFQRPRVEQIKGAMSIAFKEGLKIPPPAMNEIILGANQDIRQVLHNLSMWCARSKALTYDQ^ 

KKDIKMGPFDVARKVFMGEETAHMSLVDKSDLFFH^ 

IRSKQNWSLLPAQAIYASVLPGELMGYMT^ 
40 PLTSQG\TOVQDWALMDTmMKED^ 

SEYNEELNEDDSQSDEKDQDAIETDAMIKKKTKSSKPSKPEKDKEPRKGKGKSSKK 

450 

GTCCTTGCTTCTTTTTTCATTGATGTAMTTT^ 
CCACAGCMTTTCATTCTCTTATCGGAGCTTGTATC 
45 ACCTACAGATACTTATGMGACATTCTTTCGCTATCGTGAGTAAMGTATATAMGCTGTTTC 
AGCCTGTAATAGAMTATTTTTTCATTTTTTAAAAMGGGCGGM 
AGTTCTTGGCGCGMTGGATCCTGAGCCTCGATMC^^ 
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GGGCTGCGATGGACATTCGGAAATTCTTTGGAGTMTACCMGTC 
AMCAMGTCTGATGMGAMCTTTAAMGCAMGAMGGMTAAAG^^ 
TCAMCAAMGCAACCMGCMGAAAMGAGGATCATCTATGATTCAGATTCAGAGTCA^ 
CCAAAMGCCACCAGAAAMCTGCCAGTATCTTCTAMCCTGGTAAMTTTC 
5 CAGATGMGMGATGACTTTATGTGTMGMC^CGGCCTCT^ 

CATCAAACATGAAAMGAATGMGAAAACACTMGACCMGMTMGCCTTTATCACCMTAAM 
TTGATTATTTTGGMCTGGMGTGTCCAMGATCTMTM 

ATGAGTCTGGATTAAATGATGAAGCCATCGCCMGCMTTACAGCTTGATGMGATGCGGAGCTGGAGAG 
ATGMGAGTTTGCCAGMCATTAGCCATGTTGGATGMGMCCCMGACCAAAAAGGCTCG 
1 0 CGTTTTCATCTGTCCMGCCMTTTAAG 

AAAGAMGAGCTACAGTCCTAGGAAGCAMGTAMTATGAMGTTCAAMGMTCTCAGCMCATTO 

amtaggagaagtctcttctcccmggccagttctmgctggcaattatgaamgaamgmgagagctcttatam 
agcctgtggcctcaaamgaamgaamtgccattamttgamggagagacaaamctcctmgaamccaamgttctcca 
ctaaaamgagtctgtmgtcctgaagattctgaamgamcgcactmttatcmgcttatcgaa^ 
15 gtcccmggctctgggctccamgamtaccamgggagctgaam 
tggagtctattgmcgagatgaggccmgtctctmto^ 

attatcttgtcatgggtcgtgatagtggacagtccmgagtgatmggccgcagccttggggacaaamttattgatgmg 

gcctgttgmtctgattcggactatgccaggcmgamtccmgtatga 

mctggagagaacaccccaaaaamtgtccmggaaamgaaamttagtc 

20 ggccgacttccaamgggacagtttggcamgacmtaaaamggamcagatgtgttttggaam 

aggtggctgaggagacaagtggtgacagcmggctaggmtttggctgatgacagcagtgaamcamgtggaamt 
gggtggataaatataagccaacctcgctcaagaccataattggacagcaaggtgaccagagctgtgccaacaaactcctacgct 
ggctccgamctggcaamgagttcttccgmgataaaamcacgcagcaaagtttggtamttttc 
ctagttttamgcagcgttgctgtcaggccctcctggtgttggcaamccaccacagcttccctggtgtgtca 

25 acagctacgtggaactgmtgcaagtgacacccggagtmgagcagtttgmggcgattgttgctgagtcactc • 
gcatcaaaggctottattcamtggagcagcctcttc 

caggcaatgaggataggggaggmttcaggaattmttggcctgataamcatactaamttccca™ 

atagamtcatcccmgattcgctctctggttcamttgttttgatcttcgttt 

gtgctatgatgtctattgcatttamgmggtttamgattccccctccagctatgmtc 
30 atatcagacaggttttacatmtctgagtatgtggtgtgcacgaagtaaagcattm 

acagagccaaaaaggatatcaamtgggcccatttgatgttgcccggamgtgtttgcagctg^ 

cacttgtggacaagtcagatctcttttttcatgattattcm 

ctgtagcagcagggggtgacatgaaamgcacctgatgcttttm^ 

acagccagatccggagtmgcaamctggagtcttctgcctgcgca 
35 gggggtacatgacccagtttcccaccttcccmgctggctggggmgcactcgtctacaggcaaaca 

acctggccttgcatatgagtctcagmcttactccagcaamggactgtaaacatggattatctgtcgcttctaagggatgca^ 

ttgtacagcccttgacctcacmggagtagacggagtacaggatgtottgcacmtggacacatam 

actttgagmtatcatggaaatcagcagctggggtggcamcct 

tcacmgagcttacmtaaggmgcccaccttactccatactcacttcmgctatamggcatctagacacagcacmgccca 
'40 ccctggattcggmtacaatgmgmttamtgmgatgactctc 

ccatgatcmgaaamgacaamtcttcamgccttcaamccagaaaaagatmggagcccagaamggaaaaggaaam 

cgmgamtgaaaccatttttcactagcgacagccactttttactctccctcctga 

gttttttcccagagcmccatgttttagcatmtgggatgacc 

tatgagcagtaggcttatgtacacctcttatagaggttgataggactgcttgggtcctccactc 
45 acgtgcttctgaatgactgtagmttggmctagaaactacaccto 

ggagcagtagtactmggcgtcttttgtaggcttmgaat^ 

ggcagttcctattatgtggtgamttttgtamtamtgattatacaamcagtcaccaccta 

ttgtcagatatctttcaaagtataaattaggmtaaa 

tgtacagcaaaatattatgttggtatattacttccttattaatttgcaaatc 
50 tmcctgttgtgtgamcatmcattggcagtcgaggagtcagmctcggattcccagc^ 

acgtctctgctgtgmtaggatttcatgcagtggtctgttatggggctttgc 

tcttatccttctggagtctcmcccatcaggttcagaccagtaggmccaggctgggtcaggctc 
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GGACAAMTGMCACTGGCTTTTGTTCACTTTGCCCCTGAGAGATCTAGTC 
MGTTTGGGTTATTCCTATMCTGTATGTGTTGAGTGGOT 
TTMTCTTGTTGGTAAATCCTCAGAGGTCTGTGCT 
CCTGCTTTMC&MCTGGTAGTCTTGTTTGGGTC 
5 TGTCATTTATGGAGATTMTTCAmTGGAAATAAAACATTGCCTMGCCCTTC 

451 

MACAAARSPADQDRFICI YPAYLNNKKTI AEGRRI PI SKAVENPTATBIQDVCSAVGLNWLEKNKMYSREWNRDVQYRGRTOV 
QLKQEIX5SIX:LVQFPSRKSWLYMEMIPKLKTRTQKTGGADQSLQQGEGSKKGKGKKKK 

452 

10 CGGAMCTCAGAGCCGGGTTCCTCCCGGGTTTCTGCCGGGTTTCTCCCTGCGGCTC 
GCTTGCGCTGCCGCGCGGTCCCCGGCCGACCAG^ 

GCAGAGGGMGGCGMTCCCCATMGTMGGCTGTTGAAMTCCTACAGCTACAGAGATTC 

CTTMCGTATTTCTTGAGAAAMTAAMTGTACTCTAGAGMTGGMTCGTGATGTCCMTACAGAGG 

CTCAMCAGGMGATGGGAGCCTCTGCCTTGTACAGTTCCCATCACGTAAGTCAGTMTC 
1 5 AMCTAAAMCMGGACACAAAAMCAGGAGGTGCTGACCAMGTCTTCMCMGGAGAGGGMGTAAAAAAGGGAAAGGAAAG 

AAAMGMGTMCCTAGTATCAGCATCMGTATGTGGTACTACTGT^^ 

GAAACAGAAGCTTTTTGTTTGCATCATTTMCTGMCTGT^ 

ACAMTTTACATCAGMGTTTGCATCTCGCGTATA^ 

AATATTTTTAMTGGACAATGGACTGTACMTMGTTACTTGAMTAAGTTC 
20 ACATTATGTCCACCTTTTMGTTMTGAMTAAMTTTGAAACTGAAAAAAAAA 

454 

CAAATTCCCTCTGGCACAGTGGGGATGTGTGGAAAAAMCTGAAGTCAMGACCTCCTGCGTCGTC 
GCMGCTMTCTCTGTAGMTATGGMCAGAGGAAAAMTAAAMTACCAGTMTATCTGTTTATTCAGGTCCA^ 
GTAGGAACATAGAAAGAGTGTCTTTCTACCTAGGATTTTCCATTGMGGCCCTCTGGCATATGATATAGM 
25 ATACATCACCTGTAGTTCAMTATGTTTATTTATATCTTTATGATTTTATTCTCTCTCTCCATTC 
TTATGGCTMCCTCTMTTAGAGATTTTGCTTTTGCCTCCCTGMTGMTTACMGCCTTTT^ 
CGCAGAGCTTGGCAOAMGTGGAGTCAATCTTTTAATGTTTTAM 
ATATCCACTGGTCACATCATAACTGTCTATAGGGCMTAAMTCTGTGT^^ 

455 

30 MSRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVLLMERRNIHKLLDEWFSEKIY 

VLTNELRLIAQRYLLQYQEPIPCEQLVTALCDIKQAYTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYGGWKA 

AAAVSMLKQDYKEGEMTLKSAIiAIAIKVLNKTMDVSKLSAEKVEIATLTO 

EKEQKEKDK 

456 

35 CCGTGGACATCTCAGGTCTTCAGGGTCTTCCATCTGGMCTATATAMGTTCAGAAMCATC 

CCACTATATTTTCTCCAGAA^ 

TTTTAGCAMTGATGGTGTTTTGCTTGCAGCAGAGAG 

TTTATAMCTCMTGAGGACATGGCTTGCAGTGTGGCAGGCATMCTTCTGA 

TTGCTCAMGGTATTTATTACAGTATCAGGAGCCMTACCTTGTGAGCAGTTGGT^ 
40 ATACACMTTTGGAGGAAMCGTCCCTTTGGTGTTTCATTGCTGT 

.AGAGTGACCCTAGTGGAMTTACGGGGGATGGMGGCCACATGCATTGGAAATMTAGC 

MGACTATAMGAAGGAGAMTGACCTIGMGTCAGCACTTGCTTTAGCTATCAMGTACT 

MCTCTCTGCTGAAAMGTGGAAATTGCAACACTMCMGAGAGMTGGAMGACAGTM 
• ■ TGGAGCAGTTGATCAAAAAACATGAGGMGMGMGCCAAAGCTGAGCGTGAGM 
45 MTAGMTCAGAGATTTTATTACTCATTTGGGGCACCAOT 

CTTTTTTTMGTGGTGCAGTGGGAAMTAGGACATTACATACTGMTTGG 

TGTMCGATGATGGTTACCCTTCATGGACGTCTTAATCTO 
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457 

MMELSMGPELPTSPLAMETATOFDLLKFDVK^ 

LYWMASNYQQMNPEALNIjTPEDAVEALIGSHPVPQPLQSFDSFRGAHHHHHHHHPHPHHAYPGAGVAH 
ASPPPSSAASPAQQLPTSHPGPGPHATASATMGGNGSVEDRFSDDQLVSMSVREL^ 
. 5 QSCRYKRVQQKHHLENEKTQLIQQVEQLKQEVSRLARERDAYKVKCEKLANSGFREAGSTSDSPSSPEFFL 

458 

CGCTTGGCTCGGCGCGCTCCGGCCGGCCGCAMGTTTCCCGGGCGGCAGCGGCGGCTGCGCCTC^ 

GCTGAGCATGGGGCCAGAGCTGCCCACCAGCCCGCTGGCCATGGAGTATGTCAACGACTTCGACCTGCTC^ 

GAAGGAGCCACTGGGGCGCGCGGAG(^TCCGGGCAGGCCCTGCACACGCCTGCAGCCAGTCGGCTCGGTGTCCTCCA 

10 CAGCACTCCGTGTAGCTCCGTGCCCTCGTCGCCCAGCTTCAGOT 

GGCGAGCMCTACCAGCAGATGMCCCCGAGGCGCTCMCCTGACGCCCGAGGACGCGGTGGMGCGCTCATCGGCTCGCACCC 
AGTGCCACAGCCGCTGCAMGCTTCGACAGCTTTCGCGGCGCTCACCACCACCACCATCACCACCACCCTCACCCGCACCACGC 
GTACCCGGGCGCCGGCGTGGCCCACGACGAGCTGGGCCCGCACGCTCACCCGCACCATCACCATCATCACCMGCGTC 
GCCGTCCAGCGCCGCTAGCCCGGCGCMCAGCTGCCCACTAGCCACCCCGGGCCCGGGCCGCACGCGACGGCCTCGGCGACGGC 

15 GGCGGGCGGCAACGGCAGCGTGGAGGACCGCTTCTCCGACGACCAGCTCGTGTCCATGTCCGTGCGCGAGCTGAACCGCCACCT 
GCGGGGCTTCACCMGGACGAGGTGATCCGCCTGMGCACMGCGGCGGACCCTGMGMCCGGGGCTACGCCCAGTCTTGCAG 
GTATAMCGCGTCCAGCAGMGCACCACCTGGAGMTGAGMGACGCAGCTCATTC 
GTCCCGG(^GCCCGCGAGAGAGACGCCTACMGGTCMGTGCGAGAMCTC^ 
CAGCGACAGCCCCTCCTCTCCCGAGTTCTTTCTGTGAGTCGTGGCCGGTCCTGGCCCCCGCCCTO 

20 GTCCCACGTCCCTAGTCCCAGACTACCCCGGACCCTGTCCCTGCCGCGGOT 

AGGMGGGCGCGCGGGCCGCGGGCGACGGGCGGGTGCGCGGGCGGGCAGGGGACCTTGGCTAAGGCGAGAGTAGCGCACGCCAG 
CGCCGCCTCCTAGACTCGAGCAGAGCCGGAGAGAGAGACGAGAGGGTGGGAGGTCCCGGAGTMCTTCTCTCCAGG 
CGGCGAGGCATACTCCCGATMGTCACCMGGCCATCTGGAGACTCCTGGCTTTCTGMCTTTGCGCGT^ 
GCTTTGCTGCCCGGAGAGTAGTCCGCGCCAGGAAGAGAGCMCGAGGAMGGAGAG&GACTC 

25 CGAGGCTGAGCGAMGAAGGMGGACCAGACGGACCCTGTCTGTCAGAGTTCGGAGAAACACTGGCTCCTCAGCCCTGAGACAC 
AGGCCCTCAGTTAGGGACGCTCGGGGCACGAGGCTCATCAGTTTTATTGCCTGCTCGATTATATAGAAAMTACAAAAAATCTG 
CATTAAAMTATTAATCCTGCATGCTGGACATGTATGGTMTAATTO^ 
TGTTGATCATGGATCATACTCCCCTTGTTTCTTTGGGTC 
TCAGTCCCAGCTGTAGGCTTGTAAATACCCGCCCCGCCAMC^ 

30 TTTATTATTACGGTATTTTTGTTTGTMGTTAAAMGAAAAAAAAAMGAAAMGTTCCGG 
TTGTCTTGGGGCACACTTGGMGTTGCATGTTTTCTTTCCTTCCCTTATCCCCATTCGGTC 

TTTTCMCCTTGTTGGTGCTGAGAGAGAGAACCGAGAGGTCCCAGTACAAGGGCAGGGCAGGGCAGGGAAGCTGCCAAGCTCCG 
CACCCCAGAGGAGTGTTCTGGACTACAGCCTTGTCTTATGGTC^ 
GATCCTCCCCTCTGCTTTTTATTGTMCCAGMTCACCCTGAGGTCCCTTC 
35 CACAGCGCTCCTAGGGTGAGAGGCTTAGCCATCCCTGACCCTGGCAGTGCACTGGTAAGCAGACACTGCACTGMCCMCTGCT 
ATGCTCAGMTGTACCAGAMCCCAMCATTGGCAAGTAATm 

TCTTTCCCTGCTTTTGAGATAGTAGGMGAGTTCTTGGTGGTGTCCCCCCCCTTCMTTCTTC 
AAGATATGGGTGTTTTTCTTTATTATTACTTTTTTTTTTCTGCA 
AAATAAMGTGTATTTTTMGTTCCCATTTGAMTTGC^ 
40 ATTAACAGTAATMCMGATTGTATGMCCGCATGGTGCTTGCAGTTTTAMTATTC 
GCTTTGGTTTTTACAGATTCMCTGTGTTGAMTC 
MTTTCAMCCCTGCTTATCATATGAAAATATTAAMCCT^ 
TTAAMTGGTTTCCTATTTTGCTTATTAAAAAAAAAAAAAAAAAAM 

45 459 

MABVG\fl2KKLLLGPNGPAVAMGDLTSEEEEGQSL^ 
GLEYLYLSVHDEDRDDHTRCNVWIL^^ 

KMRELERKFVKDFQDYMEPEEGCQGSPQRRGPLTSGSDEEWAL^ 
FIQSHLRRFCLQYGAALIYTSVKEEKNLDLLYKYIV^ 
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EDFIVKPPVRKLVHDKELAAEDSQVFLMKQQSLLAKQPATPTRASESPARGPSGSPRTQGRGGPASVPSSSPGTSVKKPDPNIK 
NNAASEGVLASFFT^SLLSKKTGSPGSPGAGGVQSTMKSGQKTVLSWQEELDRMTRKPDSMVTNSSTENEA 

460 

ggcmgatggcgccggtgggggtggagaagmgctgctgctaggtcccmcgggcccgcggt^^ 
5 agtgaggaggaggmggccagac^ctatc&tcctccattctgag^ 
mgmcatcctggtcttcggtgaagatggttc^^ 
ggaagaggcctagmtatctctacctcagtgtccatgatgaggacc 
ggagacttgtaccacamggcctgctgamtttgcagtttctgctgmtc^ 

• atgtctagaccttggactgtgatggmtctctgcagamtgggctagtgttttacgtc^ 
10 ccagaaaamtgagggagctggmcggmgtttgtgaaagattttc^ 

ccacagagmgaggccctctgacctcaggctccgatgmgaamtgttgccctgcctc 

ctggggatcccggtgttggtggtgtgcacamgtgtgatgc 

ttggactttatccagtcacacctgcggaggttctgccttcagtatggagctgccttgat^ 

mcctcgacttgttgtatmgtatattgttcataamc 
15 gccgtttttatacctgcaggctgggacmtgaamgaamtagctattttacatgaam 

gcatatgmgactttattgtgamcctcccgtgaga 

ctmtgmgcmcagtcactccttgccmgcmccagccactcccacgagagcttctgm 

tctccmggacccagggtcggggagggccagccagtgtgcctagctcctccccaggcacgtcagtaaaamgc 

atcaaamtmtgcagcmgtgmggggtgttggccagcttcttc 
20 cctggtgctggtggggtgcagagcacagccmgmgtcaggacaam^ 

• atgactcgaaagccagactctatggtaacaaactc^ 
tgaccamtaactatgtatattgatctgctaagaccaggatttttctgatatggcacat^ 
gatgaactttaaaaaaaaaaaaaaaa 

461 

25 msnktolqkmgkkqngkskkveeaepeefwek . 
ekdgtkrkslsdsesddskskkkrdavdkprgfargldperiigatdss 
eerltwhscpedeaq 

462 

ggagacgctgcagacccgcgacccggagcagctcggaggcggtgmtaatagctcttcmgtctgcmtaaaam 
30 acaaaactacattgcaaaaaatgggaaaaaaacagmtggamgagtaaa 

tggaaaaagtactagatcgacgtgtagtgaatgggaaagtagaatatttcctc 

cttgggmcctgmgaaaatttagattgtccagmttgattgmgcgtttcttmgtct 

gtacaaaaagaamtctttatctgacagtgaatctgatgacagcaaatcamgmgaaaagagatgctc 

gatttgccagaggtcttgatcctgamgmtaattggtgccacagacagcagtgga 
35 attcagatgaggcagacttggtgctggcgaaagaggcamtatgmgot 

taacttggcattcttgtccagmgatgmc^tcmtmto 

TTGGGTCTTAGATTTTGATTTACTAGTGTGACAAAATMCT^ 
AGCGTTGGMGAGTTGTTGGGGTTTTTTTGCA 

TCCTTTATCAGAAMGMCATTTGATACCATGGTATATCATTTCCTCTTCATTAMGM 
40 TGTCCATAGTCATTACTCAGTCAAMCTTGTGTTCTCATGAGOT 

TGTGTGTGTGTGTGTGTGTGTATCCATAAMTGCATATGTAMTTTTT^ 
CAGGTAMCCCATGTTTCTGAGATGCCATTATTC^ 

MCCATACATTTCMGTGAMTMGTAATTCTAGATAGGACMTTTAMTTGGATM 
TTATTTGCAAMTTCCTAAMGGAAAMTTTTATCACTGCCATCACA 
45 GCTAGTGTGTTTTMCTAGCTAMCAAMCTMGTT^ 
TTGGMTCCCTGTTGCTACATTGACTAAMGGTCATGATC 
TTAGAGGTGTTGGCAGTTTAGGACCTGCTGTCATAMTGTGT^ 
TCTTTACCCCMTTCATTACATGGAGGCTCMTCTTGAGTTTGCTTTA 
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TMTCAAMTGTTATCCMCTGTATATTGTTTACTTTATTGTAAATACTGGrc 
AAAAAAAAAAAAAAAAAAAAAAAAAA 

463 

MKASAALLCLLLTAAAFSPQGLAQPVGINTSTTCCYRF^ 
5 DFMKH LDKKTQTPKL 

464 

AGCAGAGGGGCTGAGACCAMCCAGAMCCTCCMTTCTCATGTGGMGCCCATGCCCTCACCCTCCM 
AGCACTTCTGTGTCTGCTGCTCACAGCAGCTGCTTTCAGCCCCCAGGGGCTTGCTCAGCCAGTTGGGATTAATACTTCAACTAC 
CTGCTGCTACAGATTTATCMTMGAAMTCCCTAAGCAGAGGCTGGAGAGCTACAGAAGGACCACCAGTAGCCACTC 
10 GGMGCTGTMTCTTCMGACCAMCTGGACMGGAGATCTGTGCTGACCCCACACAGMGTGGGTCCAG 
CCTGGACMGAAMCCCAMCTCCAMGCTTTGMCATTCATC^ 
TXSTATACCCTGTCCTT^ 

TCATGGTTTTTCTTAGTAGATTTTAAMGTTATTMTATT^ 
• CCTTGGATGTATATGTCATCTCAGTGCTGTAAAMCTGTGGGATGCTC 
15 CTTGCAAGAATCAGTGCAAAGATTTGCTT^^ 

ATTTGCATATGTATGACTCCCAMTTTTCACATAAMTAGATTm 

465 

MQAHELFRYFRMPELVDFRQYVRTLPTNTLMGFG 

FYDDVTTLYEGFQRGIQVSOTGPCLGSRKPDQPYEWLSYKQVAELSECIGSALIQKGFKTAPDQFIGIFAQNRPEWVIIEQGCF 
20 AYSMVIVPLYDTLGNEAimVNKAELSLVF^ 

DLGRMRRKPKPPAPEDLAVICFTSGTTGNPKGAWTO 

HGAKIGFFQGDIRLLMDDLKVLQPTVFPWPRLLNRMFDRIFGQANTTLKRVJLLDFASKRKEAELRSGIIRM 
QSSLGGRVRLIWTGAAPVSATVLTFLRMLGCQFYEGYGQTECT^^ 

EGWCVKGPNWQGYLKDPMTAEALDKDGVJLHTGDIGKWLPNGTLKIIDRKKHIFKLAQGEYIAP 
25 HGESLQAFLIAIWPDVETLCSWAQKRGFEGSFEEIjCRl^DVKKAILEDMVRLGKDSGLKPFEQVKGITLHPELFSIDNGLLTP 
TMKAKRPELRNYFRSQIDDLYSTIKV . 

466 

■ TCMCACAGGACAATGCMGCCCATGAGCTGTTCCGGTATTTTCGMTGCCAGAGCT 
TCTTCCGACCMCACGCTTATGGGCTTCGGAGCTTTTGCAGCACTCACCACCTTCTGGTAC 
30 GCCGCCATGCGACCTCTCCATGCAGTCAGTGGMGTGGCGGGTAGTGGTGGTGCACGMGATCCGCACTACTTGACAGCGACGA 
GCCCTTGGTGTATTTCTATGATGATGTCACMCATTATACGMGGTT^ 
TTTAGGCTCTCGGAAACCAGACCMCCCTATGMTGGCTTTCATATAMCAGGTTGCAG 
ACTGATCCAGMGGGCTTCMGACTGCCCCAGATCAGTTCATTC 

ACAAGGATGCTTTGCTTATTCGATGGTGATCGTTCCACTTTATGATACCCTTGGAMTGAAGC^ 
35 AGCTGMCTCTCTCTGGTTTTTGTTGACM 

CCTTAAAATCATAGTTGTCATGGATGCCTACGGCAGTGMCTGGTC 

GMGGCGATGGAGGACCTGGGMGAGCCMCAGACGGMGCCCMGCCTCCAGCACCTGMGATCTTGCAGTAATTO 
MGTGGMCTACAGGCAACCCCAMGGAGCMTGGTCACTCACCGAMCATAGTGAGCGATTGTTCAGCm 
AGAGMTACAGTCAATCCTTGCCCAGATGATACTTTGATATCTTTCTTGCCTC 
40 TGTAATGCTGTGTCATGGAGCTAAMTCGGATTTTTCCM 

CACTGTCTTCCCCGTGGTTCCMGACTGCTGMCCGGATGTTTGACCGMTTTTCGGACM 
GCTCTTGGACTTTGCCTCCMGAGGAMGMGCAGAGCTTCGCAG 

CTTCCACAMGTACAGTCGAGCCTGGGCGGMGAGTCCGGCTGATGGTGACAGGAGCCGCCCCGGTC 
GTTCCTCAGAGCAGCCCTGGGCTGTCAGTTTTATGMGGA 
45 TGGAGACTGGACCGCAGGCCATGTTGGGGCCCCGATGCCGTGCMT 

GGCTGCCGAGGGCGAGGGCGAGGTGTGTGTGAMGGGCCAMTGTATTTCAGG 

AGCTTTGGACAMGACGGCTGGTTACACACAGGGGACATTGGAAAATGGTTACCAM 

AMGCACATATTTMGCTGGCACMGGAGMTACATAGCCCCTGAAMGATTGAAMTATCTACATGCGM 
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TCAGGTGTTTGTCCACGGAGAMGCCTGCA^ 

CCAAMGAGAGGATTTGAAGGGTCGTTTGAGGMCTCTGCAGAMTAAGGA 

ACTTGGGAAGGATTCTGGTCTGAAACCATTTGMCAGGTCAAJ^GGCAT^ 

CCTTCTGACTCCMCAATGMGGCGAAMGGCCAGAGCTGCGGMCTATTTCAGGTCGC 
5 CAAGGTTTAGTGTGAAGMGAMGCTCAGAGGAMTGGCACAGTTCCACMTCTCTOTCCTC 

ATTTTGMTACAGCMGTGTAGGGMGGMGCGTTCGTGTTTGACTTGTCCATTCGGGG 

MCAGMCACCGCCTTACAGTCACCTCATGTTGCAGACCATGT™ 

TAMGGGGATACTATAMTGTGCTMGTTATTTGAGACTTCCTCAGTTTAAAMGTC 

TTCTAATCAAGGGGTTAGGACTTTGCTATCTC^ 
10 GTACAGTGCACTAGAGGGMGTGTTCCCTTTAAAAATMGMCAACTGTC 

TTTTTAMTCCCTGCTACTGTCCCTO^ 

TCCAGTCCCCTGTGGTTTCTGTTGGAGCATMGGm 

GCCTCTGGCAAATGGGTGCTCATCAGAACGCGTGGATTCTCTTTC 
• TCCCCGACTCCATCCTTm 
15 CCTGGATCAGCCATGCTCAGCTGTGACGCCTGMTMCTGTCTACTTTATCTTCACTGM 

CAGATTTTTMTCTGGTTTTC^ 

CATTTCTTTAMGCTATTMGGGAGTATATACT^ 

CI<3ATTTCTATGAAATGTGTTTGACMG 

AGGGATTTTTTAAAGTTMTTTGGGAMTTMCAGCAGTTCACTTC 
20 CATTTGTACATTTAMGCAGCTGTTTTGGGGTC 

TTGTCATTGTAATMCACTACTTGGTAGCCTAACTTCAT^ 
GGGTTTTGAATGTTTCCTT^MGTGTTGGC 
AAAATAAATATTTC TGC ATTTCG 

467 

25 MADEELEALRRQRMELQMHGDPGDMQQEAKHREAEMRNSILAQVLDQSARARLSNLALVKPEKT^^ 
EKVSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSDEDDDY 

468 

CTGCTCCAGCGCTGACGCCGAGCCATGGCGGACGAGGAGCTTGAGGCGCTGAGGAGAC 

CACGGGGATCCTGGTGATGCGGCCCMCAGGMGCAMGCACAGGGMGCAGAAATGAGAMCAGTATCTTAGCCCMGTTCTG 

30 gatcagtcggcccgggccaggttaagtmcttagcacttgtaaagcctgaaaaaactaaagcagtagagm™ 
atggcmgatatggacmctmgtgagmggtatcagmcmggtttaatagaaatccttaaaamgtmgccm 
aagacmcmcagtgaaattcaacagaagaamgtmtggactctgat^ 

CTAGMCTTAACGGMCMGTCTAGGACAGMGTTMGATCTGATTATTTACTTTGTT^ 
TAAACTTGTTATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

35 469 

MQVSTMLAVLLCTMALCNQVLSAPLMOT 
YVSDLELSA 

470 

CGCAMGAGTAGTCAGTCCCTTCTTGGCTCTGCTGACACTCGAGCCCACATTCCATCACCTGCT 
40 CTGCTGCCCTTGCCGTCCTCCTCTGCACCATGGCTCTCTGCMCCAGGTCCTCTCTGCACCACGTGAGTC 

GGGTATCACCACTCTCTGGCCATGGm 

CTTAMGCGCTGCCAMC^CCTTGGTCTTTTTCTTCA 

CAGTTGCTGCTGACACGCCGACCGCCTGCTGCTTCAGCTACACCTC^ 

AGACGAGCAGCCAGTGCTCCMGCCCAGTGTCATOTCCT^ 
45 GGGTCCAGAMTACGTCAGTGACCTGGAGCTGAGT^CCTGAGGGGTCCAGMGCTTCGAGGCCCAGCGACCTCA 

TGGGGAGGAGGAGGAGCCTGAGCCTTGGGMCATGCGTGTG 

GCCAC^CTCTGGGACTCTTCTTMCTTAMTmAATTTATOT 

TGTTTGTGATTGTTTGCTCTG^ 
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ATCGGCCTGTGTAGGCAGTGATGGCACCAMGCCACCAGACTGACAAATGTGTGA^ 
GCCTGGGGAAATAATAAAGATGTTCTTTTAAACGGT 

471 

MELRPWLLWWMTGTLVLLMDAQGQK^ 
5 SRLQREPQVQWLEQQVAKRRTKRDVYQEPTDPKFPQQWYLSGVTQRDLNVKAAWAQGYTGH 

PGASFDVNDQDPDPQPRYTQMDNRHGTRCAGEVMVAMGVCGVGVAYNARIGGVRMLDGEVTDAVEM 
SVJGPEDDGKTVDGPARLAEEAFFRGVSQGRGGLGS I FWASGNGGREHDSCNCDGYTNSI YTLS ISSATQFGNVPWYSEACSST 
LATTYSSGNQNEKQIVTTDLRQKCTESHTGTSAS 

SHSYGYGLLDAGAMVALAQNWTTVAPQRKCIIDILTEPKDIGKRLEVRKTVTACLGEP 
10 LVSPMGTRSTLLAARPHDYSAIXSFNDWAFMTTHSra^ 

TLTSSQACWCEEGFSLHQKSCVQHCPPGFAPQVLDTHYSTENDVETIRASVCAPCHASCATCQGPALTXLSCPSHASLDPVE 

QTCSRQSQSSRESPPQQQPPRLPPEVEAGQRLRAGLLPSHLPEWAGLSCAFIVLVFVTWLVLQLRSGFSFRGVKW 

ISYKGLPPEAWQEECPSDSEEDEGRGERTAFIKDQSAL 

472 

15 GCGGGGMGCAGCAGCGGCCAGGATGMTCCCAGGTGCTCTGGAG 

CTGCCCGTCTCGGCCCCATGCCCCCACCAGTCAGCCCCGGGCCACAGGCAGTGAGCAGGCACCTGGGAGCCGAGGCCCTATGAC 
CAGGCCMGGAGACGGGCGCTCCAGGGTCCCAGCCACCTGTCCCCCCCATGGAGCTGAGGCCCTGGT^ 
GCMCAGGAACCTTGGTCCTGCTAGCAGCTGATGCTCAGGGCCAGMGGTCTTCACCMCACGTGGGCTGTGCGCATC 
GGCCCAGCGGTGGCCAACAGTGTGGCACGGMGCATGGGTTCCTCAACCTGGGCCAGATCTTCGGGGACTATTACCA 

20 CATCGAGGAGTGACGAAGCGGTCCCTGTCGCCTCACCGCCCGCGGCACAGCCGGCTGCAGAGGGAGCCTCAAGTACAGTGGCTG 
GMCAGCAGGTGGCAMGCGACGGACTAMCGGGACGTGTACCAGGAGCCCACAGACCCCAAGTTTCCTCAGCAGTGGTACCTG 
TCTtMTGTCACTCAGCGGGACCTGAATC^ 

GACGATGGCATCGAGAAGMCCACCCGGACTTGGCAGGCMTTATGATCCTGGGGCCAGTTTTGATGTCMTC 
GACCCCCAGCCTCGGTACACACAGATGAATGACAACAGGCACGGCACACGGTGTGCGGGGGAAGTGGCTGCGGTGGCCAACAAC 
25 GGTGTCTGTGGTGTAGGTGTGGCCTACMCGCCCGCATTGGAGGGGTGCGCATGCTGGATGGCGAGGTGA 

GCACGCTCGCTGGGCCTGAACCCCAACCACATCCACATCTACAGTGCCAGCTGGGGCCCCGAGGATGACGGCAAGACAGTGGAT 

GGGCCAGCCCGCCTCGCCGAGGAGGCCTTCTTCCGTGGGGTTAGCCAGGGCCGAGGGGGGCTGGGCTCCATCTTTC 

TCGGGGMCGGGGGCCGGGAACATGACAGCTGCAACTGCGACGGCTACACCMCAGTATCTACAC 

ACGCAGTTTGGCAACGTGCCGTGGTACAGCGAGGCCTGCTCGTCCACACTGGCCACGACCTACAGCAGTGGGAACCAGAATGAG 
30 MGCAGATCGTGACGACTGACTTGCGGCAGAAGTGCACGGAGTCTCACACGGGCACCTCAGCCTCTGCCCCCTTAGCAGCCGGC 
ATCATTGCTCTCACCCTGGAGGCCMTMGMCCTCACATGGCGGGACATGCMCACCTGGTGGTACAGACCTC 
CACCTCMTGCCMCGACTGGGCCACCMTGGTGTGGGCCGGAMGT 

GCCATGGTGGCCCTGGCCCAGMTTGGACCAGAGTGGCCCCCCAGCGGAAGTGCATCATCGACATCCTCACCGAGCCCAAAGAC 
ATCGGGAMCGGCTCGAGGTGCGGAAGACCGTGACCGCGTGCCTGGGCGAGCCCMCCACATCACTCGGCTGGAGCACGCTCA 
35 GCGCGGCTCACCCTGTCCTATMTCGCCGTGGCGACCTGGCCATCCACCTGGTC^ 

GCAGCCAGGCCACATGACTACTCCGCAGATGGGTTTMTGACTGGGCCTTCATGACMCTCATTCCTGGGATGAGGATCCCTCT 

GGCGAGTGGGTCCTAGAGATTGAAMCACCAGCGMGCCMCMCTATGGGACGCTGACCMGTTCACCCTCGTA^ 

ACCGCCCCTGAGGGGCTGCCCGTACCTCCAGAMGCAGTGGCTGCMGACC^^ 

GMGGCTTCTCCCTGCACCAGMGAGCTGTGTCCAGCACTGCCCTCCAGGCTTCGCCCCCCAAGTCCTCGATACGCACTATAGC 
40 ACCGAGAATGACGTGGAGACCATCCGGGCCAGCGTCTGCGCCCCCTGCCACGCCTCATGTGCCACATGCCAGGGGCCGGCCCTG 
ACAGACTGCCTCAGCTGCCCCAGCCACGCCTCCTTGGACCCTGTGGAGCAGACTTGCTCCCGGC^ 
TCCCCGCCACAGCAGCAGCCACCTCGGCTGCCCCCGGAGGTGGAGGCGGGGCMCGGCTGCGGGCAGGGCTGCTGCCCT^ 
CTGCCTGAGGTGGTGGCCGGCCTCAGCTGCGCCTTCATCGTGCTGGTCTTGGTCACTGTCTTCCTGGTCC 
GGCTTTAGTTTTCGGGGGGTGMGGTGTACACCATGGACCGTGGCCTCATCTCCTACMGGGGCTGCCCCC 
45 GAGGAGTGCCCGTCTGACTCAGMGAGGACGAGGGCCGGGGCGAGAGGACCGCCTTO 

GCCCACTGCCCACCCCCTCMGCCMTCCCCTCCTTGGGCACTTTTTMTTCACCAMGTATTTTT^ 
TGGACCCCAGCTGGGAGGCAAGAGGGGTGGAGACTGTTTCCCATCCTACCCTCGGGCCCACCTGGCCA 
GACCAGCTGGGGCGTGGGGAGGGCCGTACCCCACCCTCAGCACCCCTTCCATGTGGAGAMGGAGTGAM 
TGCCCCGGCCCCGGCCCCAGCCAGAGTTCCTGCGGAGTGMGAGGGGCAGCCCTTGCTTGTTO 
50 GCTCTTGCCCTTCCCTGTCCCTCTAMGCAATMTGGTCCCATCCAGGCAGTCGGGGGCTGGCC 



WO 02/46465 



PCT/G BO 1/05458 



424 

AGGCCACCTCTCCAAGGGCTTCTGCACCCTCCACCCTGTCCCCCAGCTCTGGTGAGTCTTC 
GGACCMGGCMGGCAGGTGCCTCCAGGTGTGCACGTGGCATC 
TGCCTCCACCACCACTGGCCACCAGGCTGGCGCAGCCMGGCCGMGCTCTGGCTGM 
CCCCTCTCTTGCACCCGCCTCTCCCGTCAGGGCCCMGTCCCTGTTTTCTGAGCCCGGGOT 
5 ACCTGGAGCCCCTGGGTGGGTGGTGGGGAGGGGCGCTGGCCCAGCCGGCCTCTCTGGCCTCCCACCCGATC 
TGGGGATCTCAQGGGCTGTTTGAGGATATATTTTCACTTTG 

TAATGGGGGTAGCAGCTGGACTACCCACGTTCTCACACCCACCGTCCGCCCTGCTCCTO 
GGGCMCTGCAGCATGTTGCTGAGGAGTGAGGAATAGTTGAGCCCCAAGTC 
' MGGGGGTCCCAGTGGGAGGGGCAGGCTGACATCTGTGTTTCMGTGGGGCTCGCCATGC 
10 TCCMGTGCCAGAGGTG<KCAGGTGGTGGCACTGAGCCCCCCCMCACTGTGC^ 

CCCCTCAAACCTCCTCTTCTGACGTGCCTTTTGCACCCCTCCCATTAGGACMTCAGTCC 

CTTTTCTACCCTAGCCATTCCTGGTACCCAGCCATCTGCCCAGGGGTGCCCCCTCCTCTCCCATCCCC^ 

CCCGGCTGGTTTTGTAAGATACTGGGTTGGTGCACAGTGATTTTT^ 

TTAATGGACATGAGATMTGTTAGAGGTTTTAMGTGATTAAACGTGCAGACTATGCAAACCAG 
15 473 

MAMESTATAAVMDWSADKIEDVPAPSTSADKVESLDTO^ 
GEAFFFYGKSLLELARMENGVLGMLEGVHVEEEEGEKTEDESLV^^ 

ETDKEQDSEMEKGGREDMDISKSAEEPQEKVDLTLDWLTETSEEAKGGAAPEGPNEAEVTSGKPEQEVPDAEEEKSVSGTDVQE 
ECREKGGQEKQGEVIVSIEEKPKEVSEEQPWTLEKQGTAVEVEAESLDPTVKPVDVGGDEPEEKVVTSENEA 
20 EVPPAEESPEVQTEMEASAVEAGSEVSEKPGQEAPVLPKDGAVNGPSWGDQTPIEPQTSIERLTETKDGSGLEEKVRAKLVP 
SQEETKLSVEESEMGDGVDTKVAQGATEKSPEDKVQIAANEETQEREEQMKEGEETEGSEEDDKENDKTEEMPNDSVLEMSL 
QENEEEEIGNLELAWDMLDLAKIIFKRQETKEAQL^^ 

QLGLAYGYNSQYDEAVAQFSKSIEVIENRI^VLNEQWEAEGSSEYKKEIEELKELLPEIREKIEDAKESQRSGNVAELALKAT 
LVESSTSGFTPGGGGSSVSMIASRKPTDGASSSNCVTDISHLVRKKRKPEEESPRKDDAKKAKQEPEVNGGSGDAVPSGNEVSE 
25 NMEEEAENQLKRGAAVEGTLEAGATVESTAC 

474 

gcctgagtgagtctctggcgtcccamttgcctgtttto 

gatggccatggagtccacagccactgccgccgtcgccgcggacgtggtttctgccga^ 

tacatctgcagataaagtggagagtctggatgtggatagtgmgctmgamctattgggtto 

30 gggggatattccagcagctgtcmtgcattccaggmckagc 
tggagaagccttctttttctatgggamtcacttctggagttc 
tgtgcatgtggaagaggmgmggagaaaamcagmgatgmtctctgg^ 
ggmgagttgagagmcaggtttatgacgccatgggagaaamg 
tgamctgataaagmcaggacagtgaaatggagmgggtggmgagaagatatggat^^ 

35 ggaaaaagttgacttgactctagattggttmctc 
. agctgaggtcacttctgggmgccagmcaggmgtaccagatgctgaggmgaaamtcagtttctggm 
agagtgcagagaaamggaggtcaggagaagcagggagaggtmttgtgagcataga^ 
gcagcctgtggtgactctagaamgcagggcactgcagtggaggtagmgcagagtctto 
tgtgggtggggacgagccagaggagaaggtagttacctctgaamcgaggcaggamggcggttct^ 

40 agmgtaccacctgctgmgagtcaccagaggtgcamcagaggctgcagaggcctcagctgtagaggctc 
tgaaaagcctgggcaggaggctccagttctccctaaggatggtgcagtcmtggaccgtcagttc 
tgaaccacagacttctatagaaagactgacagaaacaaaagatggctcaggactagaggagaaggtcagggcaaagctggttcc 
tagtcaggaggagactmgctgtctgtagaagagtctgaggcagctggagatggggttga 
tgagamtcacctgmgacamgttcagatagctgctmtgmgagacacmgagagagaagm 

45 mctgmggctcggmgaggatgatamgaaaatgatmgactgmgamto 
tcmgaamtgaggaggaggagattgggmcctagagcttgcctgggata^ 
agaaacaamgaagcacagctttatgctgcccaggcacatcttaaactcggagaagttag^ 
agctgtggaggagttccagtcctgccttmcctgcaggmcagtacctggaagcccacgaccgtct^ 
ccagctgggcttggcttatgggtacaactctcagtatgatgaggcagtggca 

50 CAGAATGGCTGTACTAMCGAGCAGGTGAAGGAGGCTGMGGATCGT^ 
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GCTACCCGAMTTAGAGAGAAGATAGMGATGCAMGGAGTCTCAGCGTAGTGGGMTGTAGCTC 

TCTGGTGGAGAGTTCTACTTCAGGTTTCACTCCTGGTGGAGGAGG^ 

CGGTGCTTCCTCATCAMTTGTGTGACTGATATTO 

AGATGATGCAMGAMGCCAMCAAGAGCCGGAGGTGMCGGAGGCAGTGGGGATGCTGTCCCGAGTGG 
5 AMCATGGAGGAGGAGGCTGAGMTCAGCTGAAACGCGGAGCAGCAGTGGAGGGGACACTGGAGGCTGGAGCT^ 
CACTGCATGTTMGAGGGGGCACAGCCTCCTCCCM 
TTTTTGTATAACTTCMTAMGATTGTMGCAAAAAAAAM 

475 

MEHTGHYLHLAFLMTTWSLSPGTKANYTRLWANSTSSWDSVIQNKTGRNQNEN 
10 KSEQELYIPSWSNSPSWQSIEOTSKSHGEIFKKDVCAENNNNMAM 
GKRQPRSNGDFLASGLWPAESDTWKRTKQLTGP^^ 

476 

AGTTGCAGTGGAMGAMTGTGTCATCTGTGGTTTGGTTTTT 
TTTACTTTMGGCTCATATTCTCCMGTCTATTCTGCTTO 
15 AATCAGATTTTGACCMGCATTTTGTMGATTGCCMGTATGC 
TTTCTGATGACMCAGTTTTTTCTTTGTCTCCTGGMCAAM 

GATTCAGTTATTCAAMCMGACAGGCAGAMCCAAMTGAAAACATTAACACAAACCCTATMCTCCTGMGTA 
GGTMTTCTACAMCATGCCTGAMCATCTCACATCGTAGCTTTMCT^ 

GTCAGCMCAGTCCTTCMCAGTACAGAGCATTGAAMCACMGCAAMGTCATGGTGAMTTTTCAAAAAGGATGTC 
20 GAAMCMCMCMCATGGCTATGCTMTTTGCTTM 

ACTGTGGTTTTGGCAMCAMGTCTCTTCTCTCAGACGATCAAMCMGTAGGCMGCGTCAGCCTAGMGCM 
CTGGCAAGCGGTCTATGGCCCGCT^ 

ACTGGAGTGCTCACAGCTACMGGGAAAGAAMGATGMGAAGGMCTGAAA 
AAMTGCAMGTAGCMTGAGMGGCmTGGAGTAAAMTGAAGTCAGTTGG 
25 TAAMTTTGACATGGTAGACCTTGCMTTTAGMTCMGCAGGTC^ 
GTACTTTTGTTTTGACACTGMTATTTT^^ 

GGAMCTGATTMTAGAGATAGCAAGGGATMTTAMTAMTATTCCCTATGTAGCAACAGTGGTTAGATGATCT^ 
TGTMTAAMCTTTGAATAGTTTTAGTGTGTCC 
ATAGATAGAGAGAGCTAAACTGTGTMTTTMTGGTA^ 
30 ACAGCTAATTmGAATAGGTCCTTTATCTTTCCATO 

CCTTTACTTTTGTATAATAAATTAAAACTCAGAATAAACCTGTGACCACGT 

477 

MENGDNEKMMLEMICHQIEYYFGDFNLPRDKFLKEQIKLDEGWVPLEIMIKFNRLNI^ 
DKTKIRRSPSKPLPEVTDEYKNDVmS^^ 
35 ETPGQKYKETDLLILFKDDYFAIGtNEERKQNKVEAK^^ 

FS^GEIKWIDFVRGMEGIILFKEKAKEALGKMDANNGNLQLRNKEVTWEVLEGEV 
KGKGKGNKAAQPGSGKGKVQFQGKKTKFASDDEHDEHDENGATGPVKRAREETDKEEPASKQQKTENGAGDQ 

478 

ggagtcgttgttgttgctgtttgtgagcctgtgcggcggcttctgtgggccggm 
40 ggtgataatgaamgatggctgccctggaggccaaaatc^ 

gacaagtttctamggmcagataamctggatgaaggctgggtacctttggaga 

ctmcmcagactttaatgtmttgtggmgcattgagcamtc 

atcagmggtctccmgcaaacccctacctgaagtgactgatgagtataaamtgatgtaa 

ggcttcccaactgatgcmctcttgatgacataamgmtgg™ 
45 acattgcatamgcatttaagggatcaatttttgttgtgtttgatagcattc 

cagaagtacaaagamcagacctgctaatac^ 

gtggmgctamttaagagctamcaggagcaagmgcaamcaamgttagmgmgatgctc 
mgattggatgcttgctgamttttcgggtgatttagatc^^ 
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GGTGAMTAAMTGGATAGACTTCGTCAGAGGAG 

AMGCCAMGATGCAMTMTGGTMCCTACAATTAAGGMCAMGMGTC^ 

GMGCACTGMGAAAATAATAGAAGACCMCMGMTCCCTAAACAMTGGMGTCAAMGGTCGTAGATTTAMGGAAM 
MGGGTMTAMGCTGCCCAGCCTGGGTCTGGTAMGGAAMGTA^ 
5 GAACATGATGAACATGATGAAMTGGTGCMCTGGACCTGTGAAM^^ 

AMCMCAGAAMCAGAAMTGGTGCTGGAGACCAGTAGTTTAGTAMCCM 
CTTTTGTTTGCGGGGCTTTTAAMGGAAMCCG 

GTTTMCTTGTCTTTTTGTTATGCAAATGAGATTTCTTTGMTGTATTGTTC 
AMGGMGATTCTTCCATTAMTTGCCTTTGTAATATGAGMTGTATTAGTACAMCTM 
10 AAGAGCAAAAAAAAAAAAAAAAA 

479 

MLKLTPLPSKMKVSMLLCLLLMAATFSPQG^ 
CADPKERWVRDSMKHLDQIFQNLKP 

480 

15 ATGCTGMGCTCACACCCTTGCCCTCCMGATGMGGTTTCTGCAGCGCTTCTGTGCCTGCTGCT 
CCTCAGGGACTTGCTCAGCCAGATTCAGTTTCCATTCCMTCACCTGCTGCTTTMCGTGATCM 

AGGCTGGAGAGCTACACMGMTCACCMCATCCAATGTCCCAAGGAAGCTGTGATCTTCAAGACCCAACGGGGCAAGGAGGTC 
TGTGCTGACCCCMGGAGAGATGGGTCAGGGATTCCATGMGCATCTGGACCAMTATTTCAAM 

481 

20 MARATLSAAPSNPRLLRVALLLLLLVMSRRAAGAPLATELRCQCLQT 
CLNPASPMVKKIIEKMLKNGKSN 

482 

gacagagcccgggccacggagctccttgccagctctcctcctcgcacagccgctcgaaccgcctgctgagccccatggcccgcg 

ccacgctctccgccgcccccagcmtccccgk3ctcctgcgggtggcgctgctgctcctgctcctggtggccgcca 
25 cagcaggagcgcccctggccactgmctgcgctgccagtgcttgcagaccctgcagggm 

tgmggtgmgtcccccggaccccactgcgcccamccgmgtcatagccacactcmgmtgggcagaaagcttgtc 

ccgcatcgcccatggttmgaaaatcatcgaaaagatgctgaaaaatggcaaatccaactgaccagaaggaaggaggaagctta 

ttggtggctgttcctgmggaggccctgccttacaggmcagaagaggaaagagaga 

gcctaatgtgtttgagcatacttaggagmgtctot 
30 tmtattttatgtgtaamtmggttatgattgmtctacttc 

ccmgttagttcmtcctgattcatatttmt 

gtgatgacatatcacatgtcagccactgtgatagaggctgaggmtccaagaamtggccagtmgatcm 
amtgtatgtgtgtctattttgtmctgtamgatgmtc^ 
tttctcatgttgmgctttmgmctaam 
35 taatgttmttatgcagtgtttccctctgtc^ 
taaaatatttaaaaatat 

483 

MKLVRFLMKLSHETVTIELKNGTQVHGTITGVDVSMNTHLKAVKMTLKNREPVQLETLS 
PKVKSKKREAVAGRGRGRGRGRGRGRGRGRGGPRR 

40 484 

GMTTCCCCCCCCCCCCCCAGTGCTCCGCGCGCTCTTGACG 

GTCGGTCAGTGTTCGGTTGMGGATTCTGTGTGCTGTCGGACCCAGAGGGTGACGGCGCCGCTAGGATC 
TTGATGAAATTGAGTCATGAMCTGTAACCATTC^ 

AGCATGAATACACATCTTAMGCTGTGAAMTGACCCTGMGMCAGAGMCCTGTACAGCT^ 
45 AATAACATTCGGTATTTTATTCTACCAGACAGmACCTCTGGATACACTACTTC 

AAMGGGMGCTGTTGCAGGAAGAGGCAGAGGMGAGGMGAGGAAGAGGACGTGGCCGTGGCAGAGGMGAGGGGGTC 
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CGATAATGTCTCTCMGATTTCMAGTC^^ 
TAMCATAMTGTGGGACAGAGCTGTCTATTTAG 
AGAGCTMGCATTTCTACTGGGCAGTTTCATTTTTAG 
ATGATGMTMGGTTAAMTAAAAGCCTTAGACAMTTAMTTTC^ 
5 TAGCCCCTGAACCATAATAGGCTCAGAGGCTTCAGCCCAGCTGCATAGTTGMGATTTATGGACAGM 
AAMTGGMGTGAGATACAGCMCAGCCGGATTAGTTACAGTTCAGCGTTTGCCTTAT^ 
TCTTTGGTTGGCTGAAACTTAGTGATTGCCAC 

CTGAGAAACCTTTMGCTGAACTTGAGATATGTAMGAGACTTTAGGCTAAACTTA^ 
ACTTCACATGCACTGAATATGCATTTTATTGC 
10 TGGTATTTCGGAMCTAMGTCCTTATGGGATGGGGTCTAGM^ 

TACCTGGTAAAMGTTGTCTTCTAMTTMGGGTCATTGCTTTGTTC 
TTTMTCAGTTCTTMTAGGATATAGTTTTATC 

CAGTGTTAAMTCTCCTATTMTGTGTMTGTACCTGTCAGTGCCTCCTTTATTM 
ACCTACTTTATTTGACAGCAAAAAAAAAAAGGAATTC 

15 485 

MARAALSAAPSNPRLLRVALLLLLLVM^ 
CLNPASPIVKKIIEKMLNSDKSN 

486 

cacagagcccgggccgcaggcacctcctcgccagctcttccgctcctctcacagccgccagacccgcctgctgagcccc 
20 ccgcgctgctctctccgccgcccccagcmtccccc^ctcctgcgagtggcactgctgctc 
gcgcgcagcaggagcgtccgtggccactgmctgcgctgccagt^ 
mgtgtgmcgtgmgtcccccggaccccactgcgcccamcto^ 

cmtcctgcatcccccatagttmgaaaatcatcgaamgatgctgmcagtgacamtccmctgaccagmgggaggaggm 
gctcactggtggctgttcctgaaggaggccctgcccttataggmcagaagaggamgagagacacagctc 

25 gattgtgcctmtgtgtttgagcatcgcttaggagmgtcttctatttatttatttattcattag 
atattttaggtgtaaaatmttaagggtatgattmctctacctgcacactgtcctattatattc 
ccccaagttagttcmtctggattcatatttmtttgmggtagmtgttttcam 
tgaggagcctgcmcatgccagccactgtgatagaggctggcggatccmgcamtggccmtgagatcattgtgm 
• gmtgtatgtgcacatctgttttgtaac^ 

30 catttctcatgttgaaactttaagmctaamtgttctamtatccc 
mtttmtggtagttttacagtgtttctggcttagaacam 

AGCACTTATAG 
487 

MIFPWKCQSTQRDLWNIFKLWGWTM^ 
35 SYYWIGIRKIGGIVm^GTNKSLTEEAEMGIXSEPNNKKNKEDCVBIYIK^ 
GECVEIINNYTCNCDVGYYGPQCQFVIQCEPLEAPEI£ 

TCQVIQCEPLSAPDLGIMNCSHPLASFSFTSACTFICSEGTELIGKKKTICESSGIWSNPSPICQKLDKSFSMIKEGDYNPLFI 
PVAVMVTAFSGLAFIIWLARRLKKGKKSKRSMNDPY 

488 

40 CTCCCTTTGGGCAAGGACCTGAGACCCTTGTGCTMGTCMGAGGCTC 

AGCCATGATATTTCCATGGAMTGTCAGAGCACCCAGAGGGACTTAT 

TTGTGATTTCCTGGCACATCATGGMCCGACTGCTGGACTTACCATTATTC 

ATTCTGCCGAGACMTTACACAGATTTAGTTGCCATACAAAACMGGCGGAMTTC 

TCGTTCTTACTACTGGATAGGMTCCGGMGATAGGAGGMTATGGACGTGGGTGGGM 
45 AGAGMCTGGGGAGATGGTGAGCCCMCMCMGMGMCAAGGAGGACTGCGTGGAGATC^ 

AGGCAMTGGMCGATGACGCCTGCCACAMCTAAAGGCAGCCCTCTGTTACACAGCTTCTTGC^ 

CCATGGAGMTGTGTAGAMTCATCAATMmCACCTCCAACTGTGATC 

TCAGTGTGAGCCTTTGGAGGCCCCAGAGCTGGGTACC^ 
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TGCCTTCAGCTGCTCTGMGGMCAMCTTMCTGGG 

ACCAACCTGTCMGTGATTCAGTGTGAGCCTCTATCAGCACCAGATTTGGGGATO 
CAGCTTTACCTCTGCATGTACCTTCATCTGC^ 
MTCTGGTCAMTCCTAGTCCMTATGTCAAAMTTGGACAAMGTTT^ 
5 CATTCCAGTGGCAGTCATGGTTACTGCATTCTCTGGGTTGGCATTTAra 
ATCCMGAGMGTATGMTGACCCATATTAMTCGCCCTTG 
AMTCCTTCCATGAAACGTTTTGTGTGGTGGCACCTCCTACGTCAMCATC 

tctacctgaccaacagttccttcagcttccatttcgcccctcatttatccctcmccccc 
agctttttgtcttttctgaggagamcamtaaga 
1 0 ctttcttgactcttgttt^ 

tgtgmtatggactcagttttcttcx:agatcamtttcacgtcgtc 
aaaaagtctacgctctcctttctttctmctccagtgaagtm^ 
gcctcgccgtctgtgmttggaccatcctatttaactggcttcagcctc 
ctgacttccacacctagcatctcatgagtgccmgcaamggagagaagagagamtagcctgcgctc 

15 TTTTGCTGlTrcCTTTTATGAGACCCATTCC^^ 

CACTGAMTGCTAGCTGCMGTGACATCTCTTTGATC^ 
GAMTGCTCTCCTTTCCCCTGCCCCCAGACCTTTTATCCACTTACCTAGATTC 
CCTCAACCCCACCACTTCTTTTATMCTAGTCCTTTACTMT 
GTTACCCTGTAACATGCAATTTTGCATTTGAATAAAGCCTGCTTTTTAAGTGTTAA 
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CLAIMS 

A method for the ideniification of a gene that is implicated in a specific disease or physiological 
condition, said method comprising the steps of: 

a) comparing: 

i) the transcriptome or proteome of a first specialised cell type that is implicated in the 
disease or condition under first and second experimental conditions; with 

ii) the transcriptome or proteome of a second specialised cell type under said first and 
said second experimental conditions; and 

b) identifying as a gene implicated in the disease or physiological condition, a gene that is 
differentially regulated in the two specialised cell types under the first and second 
experimental conditions. 

2. A method according to claim 1, wherein said specialised cell types, are selected from the group 
consisting of cardiomyocytes, endothelial cells, sensory neurons, motor neurons, CNS neurons, 

15 astrocytes, glial cells, Schwann cells, mast cells, eosinophils, smooth muscle cells, skeletal muscle 
cells, pericytes, lymphocytes, tumor cells, monocytes, macrophages, foamy macrophages, 
granulocytes, synovial cells / synovial fibroblasts and epithelial cells. 

3. A method according to claim 1 or claim 2, wherein said first and second experimental conditions differ 
in respect of the cellular microenvironment, or in respect of exposure to hormones, growth factors, 

20 cytokines, chemokines, inflammatory agents, toxins, metabolites, pH, pharmaceutical agents, hypoxia, 
anoxia, ischemia, imbalance of any plasma-borne nutrient, osmotic stress, temperature, mechanical 
stress, irradiation, cell-extracellular matrix interactions, cell-cell interactions, accumulations of foreign 
or pathological extracellular components, intracellular and extracellular pathogens, or a genetic 
perturbation. 

25 4. A method according to any one of the preceding claims, wherein the first experimental conditions and 
second experimental conditions differ in that under the second experimental conditions, the cells are 
exposed to a physiological stimulus. 

5. A method according to claim 4, wherein the physiological stimulus is a physiological, mechanical, 
temperature, chemical, toxic or pharmaceutical stress. 

30 6. A method according to claim 5, wherein said physiological stress is hypoxia. 
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7. A method according lo any one of the preceding claims, wherein said first and second 
experimental conditions are different genetic conditions. 

8. A method according to claim 7, wherein said second experimental conditions differ from said first 
experimental conditions in that the expression of a genetic element is expressed at a different level in 

5 said second experimental conditions relative to the level of expression of the genetic element in said 
first experimental conditions. 

9. A method according to claim 8, wherein said genetic element is heterologous to the specialized cell 
type. 

10. A method according to any one of the preceding claims, wherein the transcriptomes of the specialized 
10 cell types are compared by a technique involving hybridization to a nucleic acid array, subtractive 

mRNA hybridisation, the serial analysis of gene expression (SAGE); the selective amplification via 
biotin- and restriction-mediated enrichment (SABRE); differential display; representational difference 
analysis (RDA); differential screening of cDNA libraries; Northern blotting; an RNAse protection 
assay; an Sl-nuclease protection assays; RT-PCR; real time RT-PCR (Taq-man); EST sequencing; 
15 massively parallel signature sequencing (MPSS); or sequencing by hybridisation (SBH). 

11. A method according to claim 10, wherein the transcriptomes are compared by hybridization to a 
nucleic acid array. 

12. A substantially purified polypeptide, encoded by a gene implicated in a specific disease or 
physiological condition by a method according to any one of the preceding claims. 

20 13. A substantially purified polypeptide, which polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 
13, 15, 17,19,21,23,25,27,29,31,33,35,37,39,41, 43,45,47,49,51,53,55,57,59, 
63,67,69,73] 75, 77,85,87,89,91, 93, 95, 99, 103, 1 13, 1 15, 1 19, 121, 129, 131, 133, 
137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201,205,207 and 209; 

25 ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 

ID Nos: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 
94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 116, 118, 120, 122, 124, 126, 128, 
130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 

30 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
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202, 204, 206, 208, 210, 212, 214 and 216, or has an amino acid sequence encoded 
by a gene identified from an EST recited in any one of these SEQ ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 

5 determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

14. A polypeptide according to claim 13, wherein said biological activity is a hypoxia-regulated activity. 

15. A polypeptide according to claim 14, wherein the expression of the polypeptide is hypoxia-induced. 

16. A polypeptide according to claim 15, which polypeptide: 

10 i) comprises the amino acid sequence as recited in any one of SEQ ID Nos.: 1,3, 5, 7, 9, 1 1, 

13. 15, 17,19, 21, 23, 25, 27, 29, 31, 33, 63, 67, 69, 73, 75, 77, 85, 87, 89, 91, 93, 95, 99, 
103, 1 13, 115, 119, 121,129, 131, 133, 137,139 and 141; 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 64, 66, 68, 70, 72, 74, 

15 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 1 14, 

1 16, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142 and 144, or is 
encoded by a gene identified from an EST recited in any one of these SEQ ID Nos.; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 

20 determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

17. A polypeptide according to claim 14, wherein the expression of the polypeptide is hypoxia-repressed. 

18. A polypeptide according to claim 17, which polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos.: 35, 37, 39, 41, 
25 43, 45, 47, 49, 51, 53, 55, 57, 59, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 

207 and 209; 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos.: 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 146, 148, 150, 152, 154, 156, 
158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 
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194, 196,198,200,202,204,206, 208, 210, 212, 214 and 216, or is encoded by a 
gene identified from an EST recited in any one of these SEQ ID Nos.; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 

5 determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

19. A polypeptide which is a functional equivalent according to part iv) of any one of claims 13-18, is 
homologous to the amino acid sequence as recited in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, 15, 
17, 19, 21 , 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 73, 75, 

10 77,85, 87, 89, 91,93,95,99, 103, 1 13, 115, 119, 121, 129, 131, 133, 137, 139, 141, 145, 151, 153, 
157, 159, 163, 169, 181, 187, 201, 205, 207 and 209 or is homologous to the amino acid sequence 
encoded by a nucleic acid as recited in any one of SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214 and 216, and has equivalent biological activity to that possessed by 
the full length polypeptide of i) or ii). 

20. A fragment or functional equivalent according to any one of claims 13-19, which has greater than 50% 
20 sequence identity with the amino acid sequence as recited in any one of SEQ ID Nos: 1,3,5,7,9,11, 

13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 63, 67, 69, 
73,75,77,85,87,89,91,93,95,99,103,1 13,1 15,1 19,121,129,131,133,137,139,141,145,151, 
153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209 or with the amino acid sequence that is 
encoded by a nucleic acid as recited in any one of SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 

25 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 1 18, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 
202, 204, 206, 208, 210, 212, 214 and 216, or with fragments thereof, preferably greater than 60%, 

30 70%, 80%, 90%, 95%, 98% or 99% sequence identity. 

21. A fragment as recited in any one of claims 13-20 having an antigenic determinant in common with a 
polypeptide according to part i) of any one of claims 13-18, which consists of 7 or more (for example, 
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8, 10, 12, 14, 16, 18, 20 or more) amino acid residues from the amino acid sequence as recited in 
any one of SEQ ID Nos: 1,3,5,7,9,1 1, 13, 15, 17, 19,21,23,25, 27,29,31,33,35,37,39,41,43, 
45,47,49,51,53,55,57,59,63,67,69,73,75,77,85,87,89,91,93,95,99,103,1 13,1 15,1 19,121, 
129, 131, 133, 137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187, 201, 205, 207 and 209 or 
5 the amino acid sequence encoded by a nucleic acid as recited in any one of SEQ ID Nos: 2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 
114, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 
10 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214 and 216. 

22. A purified and isolated nucleic acid molecule that encodes a polypeptide according to any one of 
claims 13-21. 

23. A purified nucleic acid molecule according to claim 22, which consists of the nucleic acid sequence as 
recited in any one of SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 

15 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80; 82, 84, 86, 88, 90, 92, 
92a, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 1 14, 1 16, 1 18, 120, 122, 124, 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 
174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 
214 and 216, or is a redundant equivalent or fragment thereof. 

20 24. A purified nucleic acid molecule which hydridizes under high stringency conditions with a nucleic 
acid molecule according to claim 22 or claim 23. 

25. A vector comprising a nucleic acid molecule as recited in any one of claims 22-24. 

26. A delivery vehicle comprising a nucleic acid according to any one of claims 22-24 or a vector 
according to claim 25. 

25 27. A host cell transformed with a vector according to claim 25. 

28. An antagonist ligand which binds specifically to a polypeptide according to any one of claims 13-21, 
preferably a ligand which inhibits the hypoxia-induced activity of said polypeptide. 

29. An agonist ligand which binds specifically to a polypeptide according to any one of claims 13-21, 
preferably a ligand which augments or potentiates a hypoxia-induced activity of said polypeptide. 

30 30. A ligand according to claim 28 or claim 29, which is an antibody. 
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31. A ligand according to claim 28 or claim 29, which is a peptide, a peptidomimetic, or a drug 
molecule, such as a small natural or synthetic organic molecule of up to 2000Da, preferably 800Da or 
less. 

32. A polypeptide according to any one of claims 13-21, a nucleic acid molecule according to any one of 
5 claims 22-24, a vector according to claim 25 or a ligand according to claim 30 or 31, for use in therapy 

or diagnosis of disease. 

33. A polypeptide, nucleic acid molecule, vector or ligand as recited in claim 32, wherein said disease is a 
hypoxia-regulated condition. 

34. A polypeptide, nucleic acid molecule, vector or ligand as recited in claim 33, wherein said hypoxia- 
10 regulated condition is tumourigenesis, angiogenesis, apoptosis, inflammation, erythropoiesis, the 

biological response to hypoxia conditions (including processes such as glycolysis, gluconeogenesis, 
glucose transportation, catecholamine synthesis, iron transport or nitric oxide synthesis). 

35. A substantially purified polypeptide, which polypeptide: 

i) comprises the amino acid sequence as recited in any one of SEQ ID Nos: 1,3,5, 7, 9, II, 
15 13,15,17, 19,21,23,25,27,29,31,33,35,37,39,41,43,45,47,49,51, 53,55,57,59, 

63,67,69, 73,75,77,85,87, 89,91, 93, 95,99, 103, 113, 1 15, 119, 121, 129, 131, 133, 
137, 139, 141, 145, 151, 153, 157, 159, 163, 169, 181, 187,201,205, 207 and 209 or SEQ 
ID Nos.: 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 
249, 251, 253, 255, 257, 259, 261, 263, 265, 267, 269, 271, 273, 275, 277, 279, 281, 283, 

20 285, 287, 289, 291, 293, 295,297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 

. 321, 323, 325, 327, 329, 331, 333, 335, 337, 339, 341, 343, 345, 347, 349, 351, 353, 355, 
357, 359, 361 , 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 383, 385, 387, 389, 391, 
393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419, 421, 423, 425, 427, 
429,431,433,435,437,439,441,443, 445,447,449,451,453,455,457,459,461,463, 

25 465, 467, 469, 471, 473, 475, 477, 479, 481, 483, 485 

ii) has an amino acid sequence encoded by a nucleic acid sequence recited in any one of SEQ 
ID Nos: 2, 4, 6, 8,10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 92a, 
94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 114, 1 16, 1 18, 120, 122, 124, 126, 128, 

30 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 

166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198,200, 
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202, 204, 206, 208, 210, 212, 214 and 216, or has an amino acid sequence encoded 
by a gene identified from an EST recited in any one of these SEQ ID Nos; 

iii) is a fragment of a polypeptide according to i) or ii), provided that said fragment retains a 
biological activity possessed by the full length polypeptide of i) or ii), or has an antigenic 

5 determinant in common with the polypeptide of i) or ii); or 

iv) is a functional equivalent of a polypeptide of i), ii) or (iii). 

for use in the diagnosis or therapy of the- disease or abnormal physiological condition that is 
affected by hypoxia, such as cancer, ischaemic conditions, reperfusion injury, retinopathy, 
neonatal stress, preeclapmsia, atherosclerosis, inflammatory conditions, wound healing,. 
10 tumourigenesis, angiogenesis, apoptosis, inflammation, erythropoiesis, hair loss, or the biological 

response to hypoxia conditions, including processes such as glycolysis, gluconeogenesis, glucose 
transportation, catecholamine synthesis, iron transport and nitric oxide synthesis. 

36. A purified and isolated nucleic acid molecule that encodes a polypeptide as recited in claim 35, for use 
in the diagnosis or therapy of for use in the diagnosis or therapy of a disease or abnormal physiological 

15 condition that is affected by hypoxia, such as cancer, ischaemic conditions, reperfusion injury, 
retinopathy, neonatal stress, preeclapmsia, atherosclerosis, inflammatory conditions, wound healing, 
tumourigenesis, angiogenesis, apoptosis, inflammation, erythropoiesis or hair loss. 

37. A purified nucleic acid molecule'as recited in claim 36, which consists of the nucleic acid sequence as 
recited in any one of SEQ ID Nos.: 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 

20 244, 246, 248, 250, 252, 254, 256, 258*, 260, 262, 264, 266, 268, 270, 272, 274, 276, 278, 280, 282, 
284, 286, 288, 290, 292, 294, 296, 298, 300, 302, 304, 306, 308, 310, 312, 314, 316, 318, 320, 322, 
324, 326, 328, 330, 332, 334, 336, 338, 340, 342, 344, 346, 348, 350, 352, 354, 356, 358, 360; 362, 
364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 386, 388, 390, 392, 394, 396, 398, 400, 402, 
404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 426, 428, 430, 432, 434, 436, 438, 440, 442, 

25 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 466, 468, 470, 472, 474, 476, 478, 480, 482, 
484, 486 and 488, or which is a redundant equivalent or fragment thereof, for use in the diagnosis or 
therapy of a disease or abnormal physiological condition that is affected by hypoxia, such as cancer, 
ischaemic conditions, reperfusion injury, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, 
inflammatory conditions, wound healing, tumourigenesis, angiogenesis, apoptosis, inflammation, 

30 erythropoiesis or hair loss. 

38. A purified nucleic acid molecule which hydridizes under high stringency conditions with a nucleic 
acid molecule as recited in claim 36 or claim 37, for use in the diagnosis or therapy of a disease or 
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abnormal physiological condition that is affected by hypoxia, such as cancer, ischaemic 
conditions, reperfusion injury, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, 
inflammatory conditions, wound healing, tumourigenesis, angiogenesis, apoptosis, inflammation, 
erythropoiesis, or hair loss. 

5 39. A vector comprising a nucleic acid molecule as recited in any one of claims 36-38, for use. in the 
diagnosis or therapy of a disease or abnormal physiological condition that is affected by hypoxia, such 
as cancer, ischaemic conditions, reperfusion injury, retinopathy, neonatal stress, preeclapmsia, 
atherosclerosis, inflammatory conditions, wound healing, tumourigenesis, angiogenesis, apoptosis, 
inflammation, erythropoiesis, or hair loss. 

10 40. A ligand which binds specifically to, and which preferably inhibits the hypoxia-induced activity of, a 
polypeptide as recited in claim 35, for use in the diagnosis or therapy of tumourigenesis, angiogenesis, 
apoptosis, the biological response to hypoxia conditions, or a hypoxic-associated pathology. 

41. A pharmaceutical composition suitable for modulating hypoxia and/or ischaemia, comprising a . 
therapeutically-effective amount of a polypeptide as recited in any one of claims 13-21 or 35, a nucleic 

15 acid molecule according to any one of claims 22-24 or 36-38, a vector according to claim 25 or 39, or 
a ligand according to claim 30, 31 or 40, in conjunction with a p'harmaceutically-acceptable carrier. 

42. A pharmaceutical composition according to claim 41, wherein said pharmaceutically-acceptable 
carrier is a liposome. 

43. A vaccine composition comprising a polypeptide as recited in any one of claims 13-21 or 35, a nucleic 
20 ' acid molecule as recited in any one of claims 22-24 or 36-38, or a vector according to claim 25 or 39. 

44. A method of treating a disease in a patient in need of such treatment by administering to a patient a 
therapeutically effective amount of a polypeptide as recited in any one of claims 13-21 or 35, an 
antagonist of said polypeptide, or a nucleic acid molecule as recited in any one of claims 22-24 or 36- 
38. 

25 45. A method of regulating tumourigenesis, angiogenesis, apoptosis, the biological response to hypoxia 
conditions, or a hypoxic-associated pathology in a patient in need of such treatment by administering 
to a patient a therapeutically effective amount of a polypeptide according to any one of claims 13-21 or 
35, a nucleic acid molecule according to any one of claims 22-24 or 36-38, or a vector according to 
claim 25 or 39, or a ligand according to claim 30, 31 or 40 or a pharmaceutical composition according 

30 to claim 41 or 42. 
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46. A method according to claim 45, wherein, for diseases in which the expression of the natural 
gene or the activity of the polypeptide is lower in a diseased patient when compared to the level of 
expression or activity in a healthy patient, the polypeptide, nucleic acid molecule, ligand, compound or 
composition administered to the patient is an agonist. 

5 47. A method according to claim 45, wherein, for diseases in which the expression of the natural gene or 
activity of the polypeptide is higher in a diseased patient when compared to the level of expression or 
activity in a healthy patient, the polypeptide, nucleic acid molecule, vector, ligand, compound or 
composition administered to the patient is an antagonist. 

48. A polypeptide according to any one of claims 13-21 or 35, a nucleic acid molecule according to any 
10 one of claims 22-24 or 36-38, a vector according to claim 25 or 39, a ligand according to claim 30, 31 

or 40 or a pharmaceutical composition according to claim 41 or 42, for use in the manufacture of a 
medicament for the treatment of a hypoxia-regulated condition. 

49. A method of monitoring the therapeutic treatment of a disease or physiological condition in a patient, 
comprising monitoring over a period of time the level of expression or activity of polypeptide 

15 according to any one of claims 13-21 or 35, a nucleic acid molecule according to any one of claims 22- 
24 or 36-38, in tissue from said patient, wherein altering said level of expression or activity over the 
period of time towards a control level is indicative of regression of said disease. 

50. A method of providing a hypoxia regulating gene, an apoptotic or an angiogenesis regulating gene by 
administering directly to a patient in need of such therapy an expressible vector comprising expression 

20 control sequences operably linked to one or more of the nucleic acid molecules recited in claims 22-24 
or 36-38. 

51. A method of diagnosing a hypoxia-regulated condition in a patient, comprising assessing the level of 
expression of a natural gene encoding a polypeptide according to any one of claims 13-21 or 35, or 
assessing the activity of such a polypeptide, in tissue from said patient and comparing said level of 

25 expression or activity to a control level, wherein a level that is different to said control level is 
indicative of the hypoxia-related condition. 

52. A method according to claim 51 that is carried out in vitro. 

53. A method according to claim 51 or claim 52, which comprises the steps of: (a) contacting a ligand 
according to claim 30, 31 or 40 with a biological sample under conditions suitable for the formation of 

30 a ligand-polypeptide complex; and (b) detecting said complex. 

54. A method according to claim 51 or claim 52, comprising the steps of: 
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a) contacting a sample of tissue from the patient with a nucleic acid probe under 
stringent conditions that allow the formation of a hybrid complex between a nucleic acid 
molecule according to any one of claims 22-24 or 36-38 and the probe; 

b) contacting a control sample with said probe under the same conditions used in step a); and 

c) detecting the presence of hybrid complexes in said samples; 

wherein detection of levels of the hybrid complex in the patient sample that differ from levels of the 
hybrid complex in the control sample is indicative of the hypoxia-related condition. 

55. A method according to claim 51 orclaim 52, comprising the steps of: 

a) contacting a sample of nucleic acid from tissue of the patient with a nucleic acid primer under 

stringent conditions that allow the formation of a hybrid complex between a nucleic acid 
molecule according to any one of claims 22-24 or 36-38 and the primer; 

b) contacting a control sample with said primer under the same conditions used in step a); 

c) amplifying the sampled nucleic acid; and 

d) detecting the level of amplified nucleic acid from both patient and control samples; 
wherein detection of levels of the amplified nucleic acid in the patient sample that differ 
significantly from levels of the amplified nucleic acid in the control sample is indicative of the 
hypoxia-related condition. 

56. A method according to claim 51 or claim 52, comprising the steps of: 

a) obtaining a tissue sample from a patient being tested for the hypoxia-related condition; 

b) isolating a nucleic acid molecule according to any one of claims 22-24 or 36-38 from said 
tissue sample; and 

c) diagnosing the patient for disease by detecting the presence of a mutation which is 
associated with the hypoxia-related condition in the nucleic acid molecule as an indication 
of the hypoxia-related condition. 

57. The method of claim 56, further comprising amplifying the nucleic acid molecule to form an amplified 
product and detecting the presence or absence of a mutation in the amplified product. 

58. A method according to any one of claims 49-57, wherein said disease is cancer, ischaemic conditions, 
reperfusion injury, retinopathy, neonatal stress, preeclapmsia, atherosclerosis, inflammatory 
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conditions, wound healing, tumourigenesis, angiogenesis, apoptosis, inflammation, 
erythropoiesis, or hair loss. 

^ 59. A method according to claim 58, wherein said hypoxia or ischaem ia-related tissue damage is due to a 
disorder of the cerebral, coronary or peripheral circulation. 

5 60. A method according to any one of claims 49, and 54-59, wherein the tissue is a cancer tissue. 

61. A method for the identification of a compound that is effective in the treatment and/or diagnosis of 
disease, comprising contacting a polypeptide according to any one of claims 13-21 or 35, a nucleic 
acid molecule according to any one of claims 22-24 or 36-38 with one or more compounds suspected 
of possessing binding affinity for said polypeptide or nucleic acid molecule, and selecting a compound 

10 that binds specifically to said nucleic acid molecule or polypeptide. 

62. A method for the identification of a compound that is effective in the treatment and/or diagnosis of 
disease, comprising contacting a cell or cell membrane preparation comprising a polypeptide 
according to any one of claims 13-21 or 35 or a nucleic acid molecule according to any one of claims 
22-24 or 36-38 with one or more candidate compounds and detecting the degree of compound binding, 

15 or the stimulation or inhibition of a functional response in said cell or cell membrane. 

63. A compound identified or identifiable by a method according to claim 61 or claim 62. 

64. A compound according to claim 63, which is a natural or modified substrate, an enzyme, a receptor, a 
small organic molecule, such as a small natural or synthetic organic molecule of up to 2000Da, 
preferably 800Da or less, a peptidomimetic, an inorganic molecule, a peptide, a polypeptide, an 

20 antibody, or a structural or functional mimetics of any of these compounds. 

65. A kit useful for diagnosing disease comprising a first container containing a nucleic acid probe that 
hybridises under stringent conditions with a nucleic acid molecule according to any one of claims 22- 
24 or 36-38; a second container containing primers useful for amplifying said nucleic acid molecule; 
and instructions for using the probe and primers for facilitating the diagnosis of disease. 

25 66. The kit of claim 65, further comprising a third container holding an agent for digesting unhybridised 
RNA. 

67. An array of at least two nucleic acid molecules, wherein each of said nucleic acid molecules either 
corresponds to the sequence of, is complementary to the sequence of, or hybridises specifically to a 
nucleic acid molecule according to any one of claims 22-24 or 36-38. 
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68. An array according to claim 67, which contains nucleic acid molecules that either 
correspond to the sequence of, are complementary to the sequence of, or hybridise specifically to at 
163512,3,4,5,6,7,8,9,10,1 1,12,13, 14, 15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 

5 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 
85, 86, 87, 88, 89, 90, 91, 92, 92a, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 
108, 109, 110, 111, 1 12, 1 13, 114, 1 15, 1 16, 1 17, 1 18, 119, 120, 121, 122, 123, 124, 125, 126, 127, 
128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 
148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 

10 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 
208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 
228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 
248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 

15 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 
288, 289, 290, 291, 292, 293, 294 or 295 of the nucleic acid molecules implicated in a hypoxia- 
regulated condition as recited in claims 22-24 or 36-38. 

69. An array according to any claim 67 or claim 68, wherein said nucleic acid molecules consist of 
between twelve and two thousand nucleotides. 

20 70. An array of antibodies, comprising at least two different antibody species, wherein each antibody 
species is immunospecific with a polypeptide implicated in a hypoxia-regulated condition as recited in 
any one of claims 13-21 or 35. 

71. An array of polypeptides, comprising at least two polypeptide species as recited in any one of claims 
13-21 or 35, wherein each polypeptide species is implicated in a hypoxia-regulated condition, or is a 

25 functional equivalent variant or fragment thereof. 

72. A kit comprising an array of nucleic acid molecules according to any one of claims 67-69. 

73. A kit comprising one or more antibodies that bind to a polypeptide as recited in any one of claims 13- 
21 or 35; and a reagent useful for the detection of a binding reaction between said antibody and said 
polypeptide. 

30 74. A transgenic or knockout non-human animal that has been transformed to express higher, lower or 
absent levels of a polypeptide according to any one of claims 13-21 or 35. 
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75. A method for screening for a compound effective to treat disease, by contacting a non- 
human transgenic animal according to claim 74 with a candidate compound and determining the effect 
of the compound on the disease or physiological condition of the animal. 

76. A substantially purified polypeptide comprising the consensus sequence! 
KAMVACYPGNGTGYVRHVDNPNGDGRCITCIYYLNKNWDAKLHGGILRIFPEGKSF1ADVEPI 
FDRLLFFWSDRRNPHEVQPSYATRYAMTVWYFDAEERAEAKKK, or a variant thereof. 

77. A substantially purified polypeptide according to claim 76, for use in the diagnosis or treatment of a 
hypoxia-related disease or condition. 
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FIG. 32e 
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FIG. 53a 

p1E10/SeqlD:72 
cDNA FLJ11041 fis, clone PLACE1 004405 
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FIG. 53b 

p1D24/SeqlD:118 
EST 
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FIG. 53c 

p1E7/SeqlD:84 
Novel metallothionein 
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FIG. 53d 

p1F6/SeqlD:338 
Hypothetical protein hqp0376 
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FIG. 53e 

p1E22/SeqlD:162 
cDNA FLJ13618 fis, clone PLACE1010925 
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FIG. 53f 

p1P14/SeqlD:92 
Semaphorin 4b 
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FIG. 53g 

p1F17/SeqlD:330 
P8 protein (candidate of metastasis 1) 
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p1E1/SeqlD:124 
EST 
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FIG. 54b 



p1D18/SeqlD:128 
cDNA FLJ13443 fis, clone PLACE1 002853 
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FIG. 54c 

p1F9/Seq(D:20 
Hypothetical protein KIAA0742 
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FIG. 54d 

p1D1/SeqlD:24 
Hypothetical protein FLJ10134 



CD 
> 
CD 



DC 

E 

"D 
CD 
CO 

as 
E 

1— 

o 
c 




55 

X 

o 

E 
i— 

o 

CD 

c 
o 
c 



53 
x 
o 

CL 

CD 
C 

o 
c 



CO 
X 

o 

E 
i— 

o 

c 



<5 

X 

o 

Q- 
>^ 



cti 

X 

o 

E 

o 
c 

CO 
CL 



CO 
X 

o 

CL 

x: 

C/) 
CL 



WO 02/46465 



PCT/G BO 1/05458 



76/96 



4.5 
4.0 
3.5 
3.0 
2.5 
2.0 
1.5 
1.0 
0.5 



FIG. 54e 

p1F8/ SeqID:10 
Hypothetical protein KIAA0914 
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FIG. 54f 

p1D16/ SeqlD:34 
Hypothetical protein FLJ20308 
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FIG. 54g 



p1F3/SeqlD:334 
Hypothetical protein XP_017131 
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FIG. 54h 

p1D12/SeqlD:30 
Hypothetical protein KIAA1376 
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FIG. 55a 

p1D9/SeqlD:28 
Hypothetical protein DKFZP564D116 
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FIG. 55b 

pH15/SeqlD:48 
Hypothetical protein CGI-117 
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FIG. 55c 

p1l4/ SeqlD:54 
Hypothetical protein HSPC1 96 
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FIG. 55d 

p1E13/SeqlD:22 
Hypothetical protein PRO0823 
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FIG. 55e 

p1F10/ SeqlD:6 
Hypothetical protein DKFZp434P0116 
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p1l2/ SeqlD:150 
cDNA FLJ11302 fis, clone PLACE1 009971 
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FIG. 55g 



p1l5/SeqlD:42 
Hypothetical protein FLJ 1 08 1 5 
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FIG. 55h 

p1G20/SeqlD:204 
cDNA YO23H03 
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FIG. 56a 

p1G5/SeqlD:280 
MAX-interacting protein 1 
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FIG. 56b 

p1D22/ SeqlD:120 
MAX-interacting protein 1 
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FIG. 56c 

p1G17/SeqlD:316 
Early development regulator 2 
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FIG. 56d 

p1G9/SeqlD:306 
PI-3-kinase, catalytic, beta polypeptide 
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FIG. 56e 

p1K22/SeqlD:420 
GPR44 
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FIG. 56f 

p1C10/SeqlD:376 
Regulator of G-protein signalling 1 
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FIG. 56g 

p1D6/SeqlD:68 
ER01 (S. cerevisiae)-like 
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